Data Mining for the Upper Colorado River Basin Climate Effects Network
240 likes | 410 Vues
Briefing for the Upper Colorado River Basin Pilot Update Meeting, February 24, 2011. Data Mining for the Upper Colorado River Basin Climate Effects Network. Goal (completion summer 2011).
Data Mining for the Upper Colorado River Basin Climate Effects Network
E N D
Presentation Transcript
Briefing for the Upper Colorado River Basin Pilot Update Meeting, February 24, 2011 Data Mining for the Upper Colorado River Basin Climate Effects Network
Goal (completion summer 2011) • Provide a “portal” to information about people, projects, publications, and data important to UCRB climate effects research • Add information to a framework that can be sustained by scientists and data managers over the long term • Include direct participation and data from science and resource management partners
Methods • Use USGS ScienceBase and related projects already under development • Conduct data mining, information cleanup, and make contacts with scientists and other data providers • Solicit input from stakeholders to refine the system, produce a better tool, and expand capabilities (visualization)
UCRB ScienceBase Catalog Spatially located UCRB resources UCRB Footprint UCRB resources with links to more info (access, abstract)
Projects – Status • USGS BASIS+ projects, tasks, or subtasks identified as potential candidates • Over 418 reviewed and connected to UCRB in ScienceBase • Work ongoing to identify sources of other agency projects • Working on process for review and approval of “public project records” in ScienceBase
Publications – ScienceBase citation • UCRB citations harvested • into ScienceBase Catalog • abstract • authors • access methods • original metadata • Resources cataloged • in centralized locale • Refine information • (add spatial footprints) • Connect information • (e.g., see all records with • keyword “Utah”)
Publications – Status • 3,420 journal and other external citations cataloged and quality checked in Mendeley • Periodically, citations are being harvested from Mendeley and integrated with ScienceBase • 546 USGS Numbered Series reports cataloged in ScienceBase • USGS projects reviewed for publication products with items cataloged
People Contact Information
People See all resources credited to person of interest
People – Status • Over 100 USGS researchers with UCRB history identified and referenced through ScienceBase • Over 1,000 authors identified through publications record • Working on possibilities and best options for personal profiles and connections
Data • Add information to Catalog • Describe information (metadata) • Share and archive
Data Describe data using comprehensive fields: what, where, from, who, when
Data – Status • Major USGS and other corporate data assets identified and cataloged in ScienceBase • Project datasets under review from BASIS+ record • ScienceBase Uploader will be used to load these into a UCRB Project GeoDatabase • Data mining and identification through partner contacts
Addressing Science Questions • Developing ScienceBase to use data, maps, visualizations, and analytical tools to address science questions • UCRB question we are currently pursuing: How will drought/land use affect dust production and deposition, and how will this affect water supplies?
Addressing Science Questions • Requirements to address question: • Cataloged and available data and information • Visualization tools • Online mapper for quick views • Advanced GIS capabilities for geospatial analysts • Derivative product generation and packaging • Data handling capabilities • Complex datasets (MODIS, ArcMap (.mxd), databases) • Data services (GIS services, THREDDS) • Data sharing, archiving, documenting
Addressing Science Questions • Task: develop a dust source to deposition map at different scales (plot to regional across time) • Source data: soil type, geology, vegetation, land use, imagery, wind patterns • Deposition data: topography, wind patterns • Study how water supplies are affected during years of low to high dust deposition
Addressing Science Questions Geology Imagery Soils Land Cover Snowpack
Visualization Potential product
Next Steps • Acquire and integrate disparate datasets • Develop capabilities to catalog and display complex data • Develop Spatial Toolkit – online mapper • Visualize and lay out information • Establish visualization options (transparency) • Produce, share, catalog derivative data products • Plan for maintenance and sustainability
Questions and Input • http://ucrb.sciencebase.gov • Natalie Latysh (nlatysh@usgs.gov) • or Sky Bristol (sbristol@usgs.gov)