mirror of https://github.com/01-edu/Branch-AI.git
eslopfer
2 years ago
3 changed files with 0 additions and 581315 deletions
@ -1,301 +0,0 @@ |
|||||||
The Forest CoverType dataset |
|
||||||
|
|
||||||
|
|
||||||
1. Title of Database: |
|
||||||
|
|
||||||
Forest Covertype data |
|
||||||
|
|
||||||
|
|
||||||
2. Sources: |
|
||||||
|
|
||||||
(a) Original owners of database: |
|
||||||
Remote Sensing and GIS Program |
|
||||||
Department of Forest Sciences |
|
||||||
College of Natural Resources |
|
||||||
Colorado State University |
|
||||||
Fort Collins, CO 80523 |
|
||||||
(contact Jock A. Blackard, jblackard 'at' fs.fed.us |
|
||||||
or Dr. Denis J. Dean, denis.dean 'at' utdallas.edu) |
|
||||||
|
|
||||||
NOTE: Reuse of this database is unlimited with retention of |
|
||||||
copyright notice for Jock A. Blackard and Colorado |
|
||||||
State University. |
|
||||||
|
|
||||||
(b) Donors of database: |
|
||||||
Jock A. Blackard (jblackard 'at' fs.fed.us) |
|
||||||
GIS Coordinator |
|
||||||
USFS - Forest Inventory & Analysis |
|
||||||
Rocky Mountain Research Station |
|
||||||
507 25th Street |
|
||||||
Ogden, UT 84401 |
|
||||||
|
|
||||||
Dr. Denis J. Dean (denis.dean 'at' utdallas.edu) |
|
||||||
Professor |
|
||||||
Program in Geography and Geospatial Sciences |
|
||||||
School of Economic, Political and Policy Sciences |
|
||||||
800 West Campbell Rd |
|
||||||
Richardson, TX 75080-3021 |
|
||||||
|
|
||||||
Dr. Charles W. Anderson (anderson 'at' cs.colostate.edu) |
|
||||||
Associate Professor |
|
||||||
Department of Computer Science |
|
||||||
Colorado State University |
|
||||||
Fort Collins, CO 80523 USA |
|
||||||
|
|
||||||
(c) Date donated: August 1998 |
|
||||||
|
|
||||||
|
|
||||||
3. Past Usage: |
|
||||||
|
|
||||||
Blackard, Jock A. and Denis J. Dean. 2000. "Comparative |
|
||||||
Accuracies of Artificial Neural Networks and Discriminant |
|
||||||
Analysis in Predicting Forest Cover Types from Cartographic |
|
||||||
Variables." Computers and Electronics in Agriculture |
|
||||||
24(3):131-151. |
|
||||||
|
|
||||||
Blackard, Jock A. and Denis J. Dean. 1998. "Comparative |
|
||||||
Accuracies of Neural Networks and Discriminant Analysis |
|
||||||
in Predicting Forest Cover Types from Cartographic |
|
||||||
Variables." Second Southern Forestry GIS Conference. |
|
||||||
University of Georgia. Athens, GA. Pages 189-199. |
|
||||||
|
|
||||||
Blackard, Jock A. 1998. "Comparison of Neural Networks and |
|
||||||
Discriminant Analysis in Predicting Forest Cover Types." |
|
||||||
Ph.D. dissertation. Department of Forest Sciences. |
|
||||||
Colorado State University. Fort Collins, Colorado. |
|
||||||
165 pages. |
|
||||||
|
|
||||||
Abstract of dissertation: |
|
||||||
Natural resource managers responsible for developing |
|
||||||
ecosystem management strategies require basic descriptive |
|
||||||
information including inventory data for forested lands to |
|
||||||
support their decision-making processes. However, managers |
|
||||||
generally do not have this type of data for inholdings or |
|
||||||
neighboring lands that are outside their immediate |
|
||||||
jurisdiction. One method of obtaining this information is |
|
||||||
through the use of predictive models. |
|
||||||
Two predictive models were examined in this study, a |
|
||||||
feedforward neural network model and a more traditional |
|
||||||
statistical model based on discriminant analysis. The overall |
|
||||||
objectives of this research were to first construct these two |
|
||||||
predictive models, and second to compare and evaluate their |
|
||||||
respective classification accuracies when predicting forest |
|
||||||
cover types in undisturbed forests. |
|
||||||
The study area included four wilderness areas found in |
|
||||||
the Roosevelt National Forest of northern Colorado. A total |
|
||||||
of twelve cartographic measures were utilized as independent |
|
||||||
variables in the predictive models, while seven major forest |
|
||||||
cover types were used as dependent variables. Several subsets |
|
||||||
of these variables were examined to determine the best overall |
|
||||||
predictive model. |
|
||||||
For each subset of cartographic variables examined in |
|
||||||
this study, relative classification accuracies indicate the |
|
||||||
neural network approach outperformed the traditional |
|
||||||
discriminant analysis method in predicting forest cover types. |
|
||||||
The final neural network model had a higher absolute |
|
||||||
classification accuracy (70.58%) than the final corresponding |
|
||||||
linear discriminant analysis model(58.38%). In support of these |
|
||||||
classification results, thirty additional networks with randomly |
|
||||||
selected initial weights were derived. From these networks, the |
|
||||||
overall mean absolute classification accuracy for the neural |
|
||||||
network method was 70.52%, with a 95% confidence interval of |
|
||||||
70.26% to 70.80%. Consequently, natural resource managers may |
|
||||||
utilize an alternative method of predicting forest cover types |
|
||||||
that is both superior to the traditional statistical methods and |
|
||||||
adequate to support their decision-making processes for |
|
||||||
developing ecosystem management strategies. |
|
||||||
|
|
||||||
|
|
||||||
-- Classification performance |
|
||||||
-- first 11,340 records used for training data subset |
|
||||||
-- next 3,780 records used for validation data subset |
|
||||||
-- last 565,892 records used for testing data subset |
|
||||||
-- 70% Neural Network (backpropagation) |
|
||||||
-- 58% Linear Discriminant Analysis |
|
||||||
|
|
||||||
|
|
||||||
4. Relevant Information Paragraph: |
|
||||||
|
|
||||||
Predicting forest cover type from cartographic variables only |
|
||||||
(no remotely sensed data). The actual forest cover type for |
|
||||||
a given observation (30 x 30 meter cell) was determined from |
|
||||||
US Forest Service (USFS) Region 2 Resource Information System |
|
||||||
(RIS) data. Independent variables were derived from data |
|
||||||
originally obtained from US Geological Survey (USGS) and |
|
||||||
USFS data. Data is in raw form (not scaled) and contains |
|
||||||
binary (0 or 1) columns of data for qualitative independent |
|
||||||
variables (wilderness areas and soil types). |
|
||||||
|
|
||||||
This study area includes four wilderness areas located in the |
|
||||||
Roosevelt National Forest of northern Colorado. These areas |
|
||||||
represent forests with minimal human-caused disturbances, |
|
||||||
so that existing forest cover types are more a result of |
|
||||||
ecological processes rather than forest management practices. |
|
||||||
|
|
||||||
Some background information for these four wilderness areas: |
|
||||||
Neota (area 2) probably has the highest mean elevational value of |
|
||||||
the 4 wilderness areas. Rawah (area 1) and Comanche Peak (area 3) |
|
||||||
would have a lower mean elevational value, while Cache la Poudre |
|
||||||
(area 4) would have the lowest mean elevational value. |
|
||||||
|
|
||||||
As for primary major tree species in these areas, Neota would have |
|
||||||
spruce/fir (type 1), while Rawah and Comanche Peak would probably |
|
||||||
have lodgepole pine (type 2) as their primary species, followed by |
|
||||||
spruce/fir and aspen (type 5). Cache la Poudre would tend to have |
|
||||||
Ponderosa pine (type 3), Douglas-fir (type 6), and |
|
||||||
cottonwood/willow (type 4). |
|
||||||
|
|
||||||
The Rawah and Comanche Peak areas would tend to be more typical of |
|
||||||
the overall dataset than either the Neota or Cache la Poudre, due |
|
||||||
to their assortment of tree species and range of predictive |
|
||||||
variable values (elevation, etc.) Cache la Poudre would probably |
|
||||||
be more unique than the others, due to its relatively low |
|
||||||
elevation range and species composition. |
|
||||||
|
|
||||||
|
|
||||||
5. Number of instances (observations): 581,012 |
|
||||||
|
|
||||||
|
|
||||||
6. Number of Attributes: 12 measures, but 54 columns of data |
|
||||||
(10 quantitative variables, 4 binary |
|
||||||
wilderness areas and 40 binary |
|
||||||
soil type variables) |
|
||||||
|
|
||||||
|
|
||||||
7. Attribute information: |
|
||||||
|
|
||||||
Given is the attribute name, attribute type, the measurement unit and |
|
||||||
a brief description. The forest cover type is the classification |
|
||||||
problem. The order of this listing corresponds to the order of |
|
||||||
numerals along the rows of the database. |
|
||||||
|
|
||||||
Name Data Type Measurement Description |
|
||||||
|
|
||||||
Elevation quantitative meters Elevation in meters |
|
||||||
Aspect quantitative azimuth Aspect in degrees azimuth |
|
||||||
Slope quantitative degrees Slope in degrees |
|
||||||
Horizontal_Distance_To_Hydrology quantitative meters Horz Dist to nearest surface water features |
|
||||||
Vertical_Distance_To_Hydrology quantitative meters Vert Dist to nearest surface water features |
|
||||||
Horizontal_Distance_To_Roadways quantitative meters Horz Dist to nearest roadway |
|
||||||
Hillshade_9am quantitative 0 to 255 index Hillshade index at 9am, summer solstice |
|
||||||
Hillshade_Noon quantitative 0 to 255 index Hillshade index at noon, summer soltice |
|
||||||
Hillshade_3pm quantitative 0 to 255 index Hillshade index at 3pm, summer solstice |
|
||||||
Horizontal_Distance_To_Fire_Points quantitative meters Horz Dist to nearest wildfire ignition points |
|
||||||
Wilderness_Area (4 binary columns) qualitative 0 (absence) or 1 (presence) Wilderness area designation |
|
||||||
Soil_Type (40 binary columns) qualitative 0 (absence) or 1 (presence) Soil Type designation |
|
||||||
Cover_Type (7 types) integer 1 to 7 Forest Cover Type designation |
|
||||||
|
|
||||||
|
|
||||||
Code Designations: |
|
||||||
|
|
||||||
Wilderness Areas: 1 -- Rawah Wilderness Area |
|
||||||
2 -- Neota Wilderness Area |
|
||||||
3 -- Comanche Peak Wilderness Area |
|
||||||
4 -- Cache la Poudre Wilderness Area |
|
||||||
|
|
||||||
Soil Types: 1 to 40 : based on the USFS Ecological |
|
||||||
Landtype Units (ELUs) for this study area: |
|
||||||
|
|
||||||
Study Code USFS ELU Code Description |
|
||||||
1 2702 Cathedral family - Rock outcrop complex, extremely stony. |
|
||||||
2 2703 Vanet - Ratake families complex, very stony. |
|
||||||
3 2704 Haploborolis - Rock outcrop complex, rubbly. |
|
||||||
4 2705 Ratake family - Rock outcrop complex, rubbly. |
|
||||||
5 2706 Vanet family - Rock outcrop complex complex, rubbly. |
|
||||||
6 2717 Vanet - Wetmore families - Rock outcrop complex, stony. |
|
||||||
7 3501 Gothic family. |
|
||||||
8 3502 Supervisor - Limber families complex. |
|
||||||
9 4201 Troutville family, very stony. |
|
||||||
10 4703 Bullwark - Catamount families - Rock outcrop complex, rubbly. |
|
||||||
11 4704 Bullwark - Catamount families - Rock land complex, rubbly. |
|
||||||
12 4744 Legault family - Rock land complex, stony. |
|
||||||
13 4758 Catamount family - Rock land - Bullwark family complex, rubbly. |
|
||||||
14 5101 Pachic Argiborolis - Aquolis complex. |
|
||||||
15 5151 unspecified in the USFS Soil and ELU Survey. |
|
||||||
16 6101 Cryaquolis - Cryoborolis complex. |
|
||||||
17 6102 Gateview family - Cryaquolis complex. |
|
||||||
18 6731 Rogert family, very stony. |
|
||||||
19 7101 Typic Cryaquolis - Borohemists complex. |
|
||||||
20 7102 Typic Cryaquepts - Typic Cryaquolls complex. |
|
||||||
21 7103 Typic Cryaquolls - Leighcan family, till substratum complex. |
|
||||||
22 7201 Leighcan family, till substratum, extremely bouldery. |
|
||||||
23 7202 Leighcan family, till substratum - Typic Cryaquolls complex. |
|
||||||
24 7700 Leighcan family, extremely stony. |
|
||||||
25 7701 Leighcan family, warm, extremely stony. |
|
||||||
26 7702 Granile - Catamount families complex, very stony. |
|
||||||
27 7709 Leighcan family, warm - Rock outcrop complex, extremely stony. |
|
||||||
28 7710 Leighcan family - Rock outcrop complex, extremely stony. |
|
||||||
29 7745 Como - Legault families complex, extremely stony. |
|
||||||
30 7746 Como family - Rock land - Legault family complex, extremely stony. |
|
||||||
31 7755 Leighcan - Catamount families complex, extremely stony. |
|
||||||
32 7756 Catamount family - Rock outcrop - Leighcan family complex, extremely stony. |
|
||||||
33 7757 Leighcan - Catamount families - Rock outcrop complex, extremely stony. |
|
||||||
34 7790 Cryorthents - Rock land complex, extremely stony. |
|
||||||
35 8703 Cryumbrepts - Rock outcrop - Cryaquepts complex. |
|
||||||
36 8707 Bross family - Rock land - Cryumbrepts complex, extremely stony. |
|
||||||
37 8708 Rock outcrop - Cryumbrepts - Cryorthents complex, extremely stony. |
|
||||||
38 8771 Leighcan - Moran families - Cryaquolls complex, extremely stony. |
|
||||||
39 8772 Moran family - Cryorthents - Leighcan family complex, extremely stony. |
|
||||||
40 8776 Moran family - Cryorthents - Rock land complex, extremely stony. |
|
||||||
|
|
||||||
Note: First digit: climatic zone Second digit: geologic zones |
|
||||||
1. lower montane dry 1. alluvium |
|
||||||
2. lower montane 2. glacial |
|
||||||
3. montane dry 3. shale |
|
||||||
4. montane 4. sandstone |
|
||||||
5. montane dry and montane 5. mixed sedimentary |
|
||||||
6. montane and subalpine 6. unspecified in the USFS ELU Survey |
|
||||||
7. subalpine 7. igneous and metamorphic |
|
||||||
8. alpine 8. volcanic |
|
||||||
|
|
||||||
The third and fourth ELU digits are unique to the mapping unit |
|
||||||
and have no special meaning to the climatic or geologic zones. |
|
||||||
|
|
||||||
Forest Cover Type Classes: 1 -- Spruce/Fir |
|
||||||
2 -- Lodgepole Pine |
|
||||||
3 -- Ponderosa Pine |
|
||||||
4 -- Cottonwood/Willow |
|
||||||
5 -- Aspen |
|
||||||
6 -- Douglas-fir |
|
||||||
7 -- Krummholz |
|
||||||
|
|
||||||
|
|
||||||
8. Basic Summary Statistics for quantitative variables only |
|
||||||
(whole dataset -- thanks to Phil Rennert for the summary values): |
|
||||||
|
|
||||||
Name Units Mean Std Dev |
|
||||||
Elevation meters 2959.36 279.98 |
|
||||||
Aspect azimuth 155.65 111.91 |
|
||||||
Slope degrees 14.10 7.49 |
|
||||||
Horizontal_Distance_To_Hydrology meters 269.43 212.55 |
|
||||||
Vertical_Distance_To_Hydrology meters 46.42 58.30 |
|
||||||
Horizontal_Distance_To_Roadways meters 2350.15 1559.25 |
|
||||||
Hillshade_9am 0 to 255 index 212.15 26.77 |
|
||||||
Hillshade_Noon 0 to 255 index 223.32 19.77 |
|
||||||
Hillshade_3pm 0 to 255 index 142.53 38.27 |
|
||||||
Horizontal_Distance_To_Fire_Points meters 1980.29 1324.19 |
|
||||||
|
|
||||||
|
|
||||||
9. Missing Attribute Values: None. |
|
||||||
|
|
||||||
|
|
||||||
10. Class distribution: |
|
||||||
|
|
||||||
Number of records of Spruce-Fir: 211840 |
|
||||||
Number of records of Lodgepole Pine: 283301 |
|
||||||
Number of records of Ponderosa Pine: 35754 |
|
||||||
Number of records of Cottonwood/Willow: 2747 |
|
||||||
Number of records of Aspen: 9493 |
|
||||||
Number of records of Douglas-fir: 17367 |
|
||||||
Number of records of Krummholz: 20510 |
|
||||||
Number of records of other: 0 |
|
||||||
|
|
||||||
Total records: 581012 |
|
||||||
|
|
||||||
===================================================================== |
|
||||||
Jock A. Blackard |
|
||||||
08/28/1998 -- original text |
|
||||||
12/07/1999 -- updated mailing address, citations, background info |
|
||||||
for study area, added summary statistics. |
|
||||||
===================================================================== |
|
||||||
|
|
File diff suppressed because it is too large
diff.load
Loading…
Reference in new issue