|
||||
| Earth Resources Observation and Science (EROS) |
As with any large data set, the Digital Chart of the World (DCW) has some errors resulting from the map digitizing and subsequent processing. The problem then becomes how to detect these errors, and possibly correct them, as functions like TOPOGRID will not give desired results with incorrect input. Fortunately, the DCW hypsography layers have a systematic labeling of the data, allowing for a systematic solution.
Errors within the hypsography layers are detected using ARC/INFO's vector and raster tools. Contour errors are detected first. To detect the contour errors, the elevation difference between an arc and its neighbor is checked to determine if it is within a specified range. ARC/INFO's raster tool EUCALLOCATION, which forms a polygon zone for each arc, allowing the neighbor of each arc to be found and calculate the elevation difference. Arcs not within the specified range are flagged as an error. A similar approach is then applied for the points. The contours are also used with the points, however, the actual values between the two neighboring contours are needed. Here, the boundaries formed by EUCALLOCATION, are expanded back to the original arc's location with ARC/INFO's COSTALLOCATION function. Points with an elevation value outside the elevation range of the contour are flagged as an error.
The only case in which data point correction can be automated is when the point data have been generated based on another layer. The DCW supplemental point hypsography layer represents locations and values of collapsed contours. Because they are collapsed contours, their elevations will be based on the surrounding contours. Here, contour correction would be much more difficult, and less certain, because more data than just an arc's neighbor is needed.
Data errors are flagged and, where possible, corrected after processing. Using the data to detect/correct itself help keep problems (associated with incorporating external ancillary data) like registration, and projections from complicating the situation.
Two major limitations are present in this solution. First, raster processing cannot represent vector data exactly. Second, the raster functions used are slow, taking several hours for results on an average DCW 5 degree tile.
I nformation for coastlines come from PONET - political boundaries. These names coincide with those in the ESRI's ARC/INFO version of DCW.
The contour interval in HYNET is 1,000 feet, although in a few cases the interval is 2,000 feet. HSLINE has contour intervals, when present, at 250 feet intervals, in areas less than 1,000 feet in elevation, and 500 feet intervals when elevations are greater then 1,000 feet in elevation (evenly divisible by 500 but not 1,000).
T his analysis can detect error two, but a preprocessing step would eliminate this error. HSLINE has mislabeled elevations. The errors for HYNET and HSLINE do not occur often, and have no distinctive pattern.
Figure 1 shows an example of a mislabeled contour. The dashed line represents a line that should be labeled 12,000 instead of 13,000, based on the surrounding arcs and the original ONC, (bounding box: -70 -35.15 -69.85 -35, DCW tile HD22) [3].
Of the two point layers, HSPOINT was the only one with any errors. HYPOINT was added for completeness, though cases where points fall outside a specified elevation zone and has a data flag indicating an unknown location are removed. Based on analysis of the HSPOINT errors, there may be a pattern. It appears that sometimes when HSPOINT was given an elevation, its value was based on the elevation of the surrounding HYNET contours, and did not consider the values of any HSLINE contours that where in closer location to the points. An example of mislabeled points is in the Horn Region of Africa, where there are HSLINE contours along the coast at 500 and 750 feet, and between them HSPOINT has values of 1000, which is the first primary contour from HYNET. Point values were also present that were labeled incorrectly, not corresponding to any of the surrounding contours.
Figure 2 shows an example of the mislabeled points. The 1000 foot point elevations to the right of the dashed line should be 500 feet, (bounding box: 38.1 -6.5 39.0 -6, DCW tile QF22) [3].
The described errors need to be addressed for digital elevation model generation (DEM), as part of a project to generate a 30 arc second DEM of the world. The generated DEM will inherit any errors from the source data. Another issue is that some errors are severe enough or, there are just too many errors, to cause the surface generation software to terminate execution prematurely, due to its inability to model a surface over inaccurate data. Generating a surface with these errors will cause some applications, such as hydrologic analysis, to be inaccurate.
The reason error detection can be automated is that the data exist in a defined format. As described earlier, the data exist at 1000 feet intervals, sometimes with 500 foot contours between them. Therefore, the range between contours must be either 1000 feet when there are no supplemental contours present, or 500 feet when supplemental contours are present. In areas below 1000 feet, 250 feet is added to the definition when supplemental contours are present. With this definition, all that is needed is to find out what a neighboring contours value is, and check that value, which must be in the defined range. If the data are not in that range, a possible error has been detected.
Because the supplemental points are collapsed contours, they fall into the same definition as above. The contours are also flagged as increasing or decreasing in value. Therefore if the elevation zone is increasing, any found points within the zone must be the larger value. If the zone is decreasing, any points must be the smaller value.
The goal is to detect, and either correct or remove, the incorrect elevations from the four layers automatically, though some manual processing will be required. Automatic processing was done using tools available in ARC/INFO from both the vector and raster environment. The intent is also to make the process generic enough so that it may be applied to other data sets, and not just for this project.
T he previous twelve steps were performed with ARC/INFO Revision 7.0.2, using a combination of vector and raster routines to achieve the stated objectives.
Information needed for each
contour arc include its left and right neighbor, which
defines a zone between them with a value range. With this
information, ranges for each arc can be determined. There is
no tool in ARC/INFO to do this automatically. As shown in
Figure 3, there is no automatic way for A to know its left
neighbor is B, and its right neighbor is C.
The next approach is to buffer
each arc until it meets its neighbor, which is also being
buffered. This point of contact is a line equidistant between
the two arcs. Refer to Figure 4.
These lines now define polygons for the zone of influence for each contour arc. However, ARC/INFO does not have a vector tool to do this. The vector routine BUFFER will not stop at the equidistant point, it only expands out to a specified distance. ARC/INFO does have a raster tool to do this, EUCALLOCATION. EUCALLOCATION computes for each cell the value of its closest source cell (in Euclidean distance). The above works since A, B, and C all have different, but consistent values, therefore it does not make any difference how many distinct arcs define A, B, or C, since they all should have the same respective value. When expanded using the internal number, the appropriate value can be retrieved directly. Using the internal arc number creates multiple polygons with the same elevation, therefore it is necessary to dissolve these polygons with the same elevation into one polygon. This is necessary for future processing. These boundary arcs can now determine the proper elevation range, and if a contour is out of range, using the original arc number, the appropriate arcs can be flagged.
The supplemental points elevation is determined by looking at the surrounding arcs. See Figure 5. The values of A and B need to be considered to determine the value of K. However, ARC/INFO does not define a tool to find all the arcs surrounding a point, NEAR only finds the closest arc. Finding the closest arc would work when used with the results from the process illustrated in Figure 4, if it were known on which side of A or B, K was found. If it is assumed A is closer, with that data, A could look at its neighboring value on that side to determine the range K should be within. Unfortunately, there is no utility available to determine this in ARC/INFO, plus there is the following case to complicate maters.
In Figure 7, K is on the left of A, with the defined zone being A and B, and K is closest to A. When finding the closest equidistant arc though, the arc between A and C is returned, not A and B, therefore that approach cannot be used.
Figure 8 shows the results of using EUCALLOCATION to expand the arcs back. The problem is that the original arcs are not located equidistant to one another. There is another ARC/INFO raster routine available to stop the expansion at specific locations, COSTALLOCATION. COSTALLOCATION is designed for cost analysis, expanding cells of a grid into no data cells like EUCALLOCATION, but also allowing the expanded values to be affected by a cost grid. This cost grid also allows the process to define areas it cannot expand into by assigning those areas to no data. Setting the cost grid to no data where the original arcs exist, and the value to 1 everywhere else, will allow the equidistant arcs to be expanded back to the original arc locations, defining elevation zone polygons. This will allow the proper value of points to be determined based on the output from COSTALLOCATION.
A definite problem is the inability of raster space to accurately represent vector space. This may cause original contours to be lost when they are rasterized, therefore causing false flagging of errors. To help eliminate this problem, a cell size of 3.75 arc seconds was used, resulting in 960 cells per degree. This eliminated many instances of false flagging, but not all of them.
There is also a problem with the data on the edge of some ONCs where edge matching is not correct. This caused gaps with some contours, causing the zones in step 6 of the algorithm to bleed across their appropriate boundaries. When this occurs, it could result in changing/deleting correct points within the data. Fortunately this is not common.
The process used to determine the point elevation values, and to flag the appropriate arcs is time consuming, especially with the cell size being used. To process a five degree tile requires about a day on a Data General Aviion 410 processor.
Processing could be improved if it could all be vector based instead of using raster tools. This would eliminate the problem of contours being to close to accurately model in a raster environment.
Kevin S. Larson
Consultant
Berger & Co.
2828 Routh Street, Suite 350, Lock Box 17
Dallas, Texas 75201
USA
Telephone: (214) 922-8010
FAX: (214) 922-8020
E-mail: klarson@berger.com
http://www.berger.com(External Web Site)
|
For further assistance, please contact: Customer Services U.S. Geological Survey Center for Earth Resources Observation and Science (EROS) 47914 252nd Street Sioux Falls, SD 57198-0001 Tel: 800-252-4547 Tel: 605-594-6151 Fax: 605-594-6589 Email: custserv@usgs.gov Business Hours: Monday thru Friday, 8:00 a.m. to 4:00 p.m., central time |
| Accessibility FOIA Privacy Policies and Notices | |
![]() |
|