60 likes | 197 Vues
The VARS Data Mining Project introduces the innovative Snapshot Tool, designed to facilitate data analysis and accessibility for researchers. This tool provides efficient mechanisms for searching and downloading data sets. It targets scientists and annotators needing to access embargoed data and work with correlated ROV navigation information. The installation requires Java 1.5, at least 512MB RAM, and sufficient disk space. Future improvements include better interop with other VDM software and integration with Microsoft Research to enhance machine learning capabilities and minimize bias in annotation processes.
E N D
VARS Data Mining An introduction to the snapshot tool
Overview • What is the VARS Data Mining Project? • What does this “snapshot tool” do? • Who is it for?
Installation • Requirements • Java 1.5 • 512MB RAM (1GB ?) • Disk Space • How to handle .tar.gz • Where to install
Future Improvements • Scheme to allow MBARI staff to access embargoed data • ROV Navigation correlated to current search set • Alternate search parameters? Scientist, annotator, season, etc… • Interop with VARS, Interop with other VDM software. • Search set download mechanism
Future of Mining VARS • Possible collaboration with Microsoft Research. • Begin work on machine learning oriented data mining. • Easy access to data within “R” • Look for biases in our current annotation process.