1 / 43

Georeferencing: Collaboration to Automation in Biodiversity Data Management

Learn about georeferencing, collaboration, and automation in managing biodiversity data. Understand georeference terms, data quality, and acceptable georeference standards. Discover methods for precise georeferencing and data quality enhancement.

tspotts
Download Presentation

Georeferencing: Collaboration to Automation in Biodiversity Data Management

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Georeferencing Introduction: Collaboration to Automation Michelle Koo, Carol Spencer, David Bloom, Nelson Rios Museum of Vertebrate Zoology (UC Berkeley), VertNet, & Tulane University

  2. 1. Georeferencing 2. Collaborations 3. Automation

  3. What is a georeference? A numerical description in a coordinate system of a place.

  4. What we have:Localities we can read ID Species Locality 1 Lynx rufus Dawson Rd. N Whitehorse 2 Pudu puda cerca de Valdivia Canis lupus 3 20 mi NW Duluth 4 Felis concolor Pichi Trafúl 5 Lama alpaca near Cuzco 6 Panthera leo San Diego Zoo 7 Sorex lyelli Lyell Canyon, Yosemite 8 Orcinus orca 1 mi W San Juan Island 9 Ursus arctos Bear Flat, Haines Junction

  5. What we want:Localities we can map

  6. Darwin Core • Metadata standards initially built off of Dublin Core Metadata Standards • Collections of any kind of biological objects or data. • Terminology associated with biological collection data. • Striving for compatibility with other biodiversity-related standards. • Facilitating the addition of components and attributes of biological data.

  7. Darwin Core Location Terms • HigherGeography • waterbody, island, islandGroup • continent, country, countryCode, stateProvince, county, municipality • locality • minimumElevationInMeters, maximumElevationInMeters, minimumDepthInMeters, maximumDepthInMeters

  8. Darwin Core Georeference Terms • decimalLatitude, decimalLongitude • geodeticDatum • coordinateUncertaintyInMeters • coordinatePrecision • pointRadiusSpatialFit • footprintWKT, footprintSRS, footprintSpatialFit • georeferencedBy, georeferenceProtocol • georeferenceSources • georeferenceVerificationStatus • georeferenceRemarks

  9. What is a georeference? A numerical description of a place that can be mapped.

  10. “Davis, Yolo County, California” Coordinates: 38.5463 -121.7425 Horizontal Geodetic Datum: NAD27 Coordinates: 38.5463 -121.7425 Horizontal Geodetic Datum: NAD27 “point method”

  11. Data Quality • data have the potential to be used in ways unforeseen when collected. • the value of the data is directly related to the fitness for a variety of uses. • “as data become more accessible many more uses become apparent” – Chapman 2005 • the MaNIS/HerpNET/ORNIS guidelines follow best practices (Chapman and Wieczorek 2006) to enhance data quality and value

  12. What is an acceptable georeference? A numerical description of a place that can be mappedand that describes the spatial extent of a locality and its associated uncertainties.

  13. “Davis, Yolo County, California” Coordinates: 38.5486 -121.7542 38.545 -121.7394 Horizontal Geodetic Datum: NAD27 Coordinates: 38.5486 -121.7542 38.545 -121.7394 Horizontal Geodetic Datum: NAD27 “bounding-box method”

  14. “Davis, Yolo County, California” Coordinates: 38.5468 -121.7469 Horizontal Geodetic Datum: NAD27 Maximum Uncertainty: 8325 m Coordinates: 38.5468 -121.7469 Horizontal Geodetic Datum: NAD27 Maximum Uncertainty: 8325 m “point-radius method”

  15. What is an ideal georeference? A numerical description of a place that can be mappedand that describes the spatial extent of a locality and its associated uncertaintiesas well as all possibilities.

  16. “Davis, Yolo County, California” “shape method”

  17. “20 mi E Hayfork, California” “probability method”

  18. Method Comparison easy to produce no data quality point simple spatial queries difficult quality assessment bounding-box easy quality assessment difficult spatial queries point-radius accurate representation complex, uniform shape accurate representation complex, non-uniform probability

  19. Meridians of Graticular Parallels of Latitude Longitude Network Georeferencing Using MaNIS/HerpNET/ORNIS Guidelines

  20. MaNIS/HerpNET/ORNIS (MHO) Guidelines http://manisnet.org/GeorefGuide.html • uses point-radius representation of georeferences • circle encompasses all sources of uncertainty about the location • methodology formalizes assumptions, algorithms, and documentation standards that promote reproducible results • methods are universally applicable

  21. MHO Guidelines • Think of georeferencing as “many-stepped process” – MHO projects produced a first pass. Then validation and refinement should be done using itineries, field notes, collector verification and by mapping the localities and making these maps available on-line.

  22. Data Quality • “Fitness of use” of the data • As a collector, you may have an intended use for the data you collect but data have the potential to be used in unforeseen ways…. The value of your data is directly related to the fitness for a variety of uses. • As data become more accessible many more uses become apparent. – Chapman 2005, Chapman and Wieczorek 2006 • We are using the MHO methods as a tool to enhance data quality

  23. Maximum Error Distance from Uncertainties: • Uncertainty is a “measure of the incompleteness of one’s knowledge or information about an unknown quantity whose true value can be established if a perfect measure device were available.” (Cullen & Frey 1999) • In MHO Guidelines, this is defined as the numerical value for the upper limit of the distance from the coordinates of a locality to the outer extremity of the area within which the whole of the described locality must lie (i.e., what can be mistaken for that locality based on the description given).

  24. Extents: • Extent- the geographic range, magnitude or distance that a location may actually represent. (With a town, the extent is the polygon that encompasses the area inside the town’s boundaries.) • Linear extent- what we use for the Point-Radius Method. Defined as the distance from the geographic center of the location to the furthest point of the geographic extent of the location.

  25. Precision and Accuracy: • Always use as many decimal places as given by the coordinate source. • A measurement in decimal degrees given to five decimal places is more precise than a measurement in degrees minutes seconds. • False precision will result if data are recorded with a greater number of decimal points (e.g., when converting from DMS to decimal degrees). • Always record the accuracy of your GPS readings (how well the GPS measures the true value of the location). The accuracy is given at the same time as the coordinate, but usually will not be recorded with the coordinates when you output them on recreational GPS units. Otherwise, default accuracy is assumed 30 m, so stating your accuracy is better.

  26. Geographical Concepts: • Geodetic Datum:defines the position of the origin, scale, shape, and orientation of a 3-dimensional model of the earth. Example: WGS84. • Coordinate System: defines the “units of measure” of position with respect to the datum. Example: latitude, longitude in degrees, minutes, seconds

  27. Map Projections: • Mathematical transformations of the 3-D model of the surface of the earth onto a 2-D map. • Many different kinds (e.g., conical, cylindrical, azimuthal) – all are compromises in distortions (either area, shape, distance, or direction), but some preserve areas or distances. • When measuring distances on paper maps, use an equal distance projection, if available, otherwise understand the implications.

  28. Georeferencing Concepts • Named place: a place of reference in a locality description. Example: “Davis” in “5 mi N of Davis” • Areal extent:the geographic area covered by a named place (feature). Example: the area inside the boundaries of a town. • Linear extent:the distance from the geographic center to the furthest point of the areal extent of a named place.

  29. Georeferencing Concepts • Offset: the distance from a named place. Example: “5 mi” in “5 mi NE of Beatty”. • Heading: the direction from a named place. Example: “NE” in “5 mi NE of Beatty”.

  30. Georeferencing Concepts • coordinateUncertaintyInMeters:“The horizontal distance (in meters) from the given decimalLatitude and decimalLongitude describing the smallest circle containing the whole of the Location. Leave the value empty if the uncertainty is unknown, cannot be estimated, or is not applicable (i.e., there are no coordinates). Zero is not a valid value for this term.” (from Darwin Core) • Maximum Error Distance: same as coordinateUncertaintyInMeters, except the units are the same as in the locality description, not necessarily meters.

  31. Scale Uncertainty (ft) Uncertainty (m) 1:1,200 3.3 ft 1.0 m 1:2,400 6.7 ft 2.0 m 1:4,800 13.3 ft  4.1 m 1:10,000 27.8 ft 8.5 m 1:12,000 33.3 ft 10.2 m 1:24,000 40.0 ft  12.2 m 1:25,000 41.8 ft 12.8 m 1:63,360 106 ft 32.2 m  1:100,000 167 ft 50.9 m 1:250,000 417 ft 127 m Sources of uncertainty: 20° 30’ N 112° 36’ W • Coordinate Uncertainty • Map scale • The extent of the locality • GPS accuracy • Unknown datum • Imprecision in direction measurements • Imprecision in distance measurements (e.g., 1 km vs. 1.1 km)

  32. 1. Georeferencing 2. Collaborations 3. Automation

  33. Collaborative Distributed Database Portals for Vertebrates

  34. Collaborations

  35. MaNIS Localities Georeferenced n = 326k localities (1.4M specimens) r = 14 localities/hr (point-radius method) t = 3 yrs (~40 georeferencers)

  36. ORNIS Localities Georeferenced n = 267k localities (1.4M specimens) r = 30 localities/hr (point-radius method) t = 2 yrs (~30 georeferencers)

  37. HERPNET Localities Georeferenced n = 646 k localities (1.8 M specimens) r = 15 localities/hr (point-radius method) t =5 yrs (111 georeferencers)

  38. Scope of the Problem for Natural History Collections ~2.5 Billion (109) ~6 records per locality* ~14 (30) localities per hour* ~15,500 (7,233) years * based on the MaNIS (ORNIS) Project

  39. The Collaboration continues…

  40. 1. Georeferencing 2. Collaborations 3. Automation

  41. Automation Tools for Georeferencing BioGeomancer Classic GeoLocate Georeferencing Calculator DIVA-GIS

  42. http://www.biogeomancer.org

More Related