Lecture 23

Dataset used in class for multiple regression. The data are from the Los Angeles Pollution Control District for 30 days one summer and I found them in Rice: Mathematical Statistics and Data Analysis. The meteorological variables are the average of morning readings at four stations. The oxidant level are the maximum level observed for each day.

Day WindSped Temp Humidity Insolat Oxidant
1 50 77 67 78 15
2 47 80 66 77 20
3 57 75 77 73 13
4 38 72 73 69 21
5 52 71 75 78 12
6 57 74 75 80 12
7 53 78 64 75 12
8 62 82 59 78 11
9 52 82 60 75 12
10 42 82 62 58 20
11 47 82 59 76 11
12 40 80 66 76 17
13 42 81 68 71 20
14 40 85 62 74 23
15 48 82 70 73 17
16 50 79 66 72 16
17 55 72 63 69 10
18 52 72 61 57 11
19 48 76 60 74 11
20 52 77 59 72 9
21 52 73 58 67 5
22 48 68 63 30 5
23 65 67 65 23 4
24 53 71 53 72 7
25 36 75 54 78 18
26 45 81 44 81 17
27 43 84 46 78 23
28 42 83 43 78 23
29 35 87 44 77 24
30 43 92 35 79 25