Challenges in building and using aLifetime Personal Information Storebased on MyLifeBits Gordon BellAccelerating Change 6 November 2004
1TB gives you 65+ years of: 100 email messages a day (5KB each) 100 web pages day (50KB each) 5 scanned pages a day (100KB each) 1 book every 10 days (1 MB each) 10 photos per day (400 KB JPEG each) 8 hours per day of sound - e.g. telephone,voice annotations, and meeting recordings (8 Kb/s) 1 new music CD every 10 days (45 min each at 128 Kb/s) It will take you 5 years to fill up your 80 GB drive Want video? Buy more cheap drives (1 TB/year lets you record 4 hours/day of 1.5 Mb/s video) The 1 TB Life
Everything goes in a database • You need all the features of a database(Consistency, Indexing, Pivoting, Queries, Speed/scalability, Backup, replication) • If you don’t use one, you will find yourself creating one! • Files as blobs, also sync with file system for legacy apps SQL
SenseCam Telephone capture tool PocketPC transfer tool PocketRadio player TV capture tool GPS import & Map display Radio capture & EPG TV EPG download tool MAPI interface Legacy email client Browser tool Internet files Screen saver Legacy applications MyLifeBits Shell IM capture Voice annotation tool Text annotation tool Import files MyLifeBits Software MyLifeBits store database
MemexAs We May Think, Vannevar Bush, 1945 “A memex is a device in which an individual stores all his books, records, and communications, and which is mechanized so that it may be consulted with exceeding speed and flexibility” • Full-text search, text & audio annotations, and hyperlinks
The guinea pig • Gordon Bell is digitizing his life • Has now scanned virtually all: • Books written (and read when possible) • Personal documents (correspondence including memos and email, bills, legal documents, papers written, …) • Photos • Posters, paintings, photo of things (artifacts, …medals, plaques) • Home movies and videos • CD collection • And, of course, all PC files • Now recording: phone, radio, TV (movies), web pages… conversations and meetings to come • Paperless throughout 2002. 12” scanned, 12’ discarded. • Only 44 GB, incl. 10 wma, 14 SQL!!! Video: o(100) + 500 mov
Personal LifeLog Applications Diary/Journal Self Tutor Mentor Advisor Meeting Prep Personal Assistant Babysitter Financial Manager Companion Caretaker Medical Manager Parole Officer Photo Album Assistant for Elderly Application used by: Pers Flight Recorder Autobiography Captain’s Log Conservator Biography Baby Book Trustee Obituary Personal Proxy Executor Others Application controlled by: Others Self
Personal Search is notProfessional or Web search • System sees every entry & access • Everything, not just a professional life • Limited to SIS, not an infinite amount, covers a profession & personal life Professional user Depth e.g. information item types & coverage MyLifeBits Web as seen by search engines Knowledge breadth e.g. Dewey classification
Why bother? ..some reasons • Technologist: “we can” an opportunity e.g. 1 TB disks • For all of us with new media: a need e.g. jpg. Mp3 • Environmentalist: eliminates “atoms” (paper, CDs…) • For business--memory enhancement & faster search:Let content analysis and data mining discover trends and correlations in our lives…that even we don’t know. • Business: It costs more to delete than it costs to store • Preservationist: decays or disappears unless its saved • For the human pack rat: “I may need it some day.” • For posterity and nostalgia: “Maybe others will want it.” • Stories and ambience: basis for creating content • For the aging & failed memory: surrogate memory
So you’ve got it – now what do you do with it? “A record if it is to be useful … must be continuously extended, it must be stored, and above all it must be consulted” “The difficulty seems to be, not so much that we publish unduly … but rather that publication has been extended far beyond our present ability to make real use of the record” - Vannevar Bush
Using my life bits: beyond folders #1: Folders One item. One place. It worked for 1000s of years.
Library/file cab My docs and archive X- Employer Active Employer Library/file cab Project Employer Employer Self E Project E Business Invests, family $s, & Legal Project Employer S Library/file cab Library/file cab Library/file cab Library/file cab X-Employer Project Personal, including Medical Library/file cab Project Library/file cab Library/file cab Library/file cab Library/file cab <1995 Library/file cab
Freedom from hierarchy • c:\my documents\talks\MyLifeBits.pptID=location=organization=display string • Don’t make me invent unique names • Don’t make me file everything • Or let me pick multiple folders
Using my life bits: easily adding valuable content #2: Text annotations Making bits more valuable and retrievable.
Getting the user to tell a story is the ultimate in media value • A story is a “layout” in time and space • Most valuable content (by selection, and by being well annotated) • Stories must include links to any media they use (for future navigation/search – “transclusion”). • Cf: MovieMaker; Creative Memories PhotoAlbums We took him to lunch at our favorite Dim Sum place to say farewell Dapeng was an intern at BARC for the summer of 2000 At table L-R: Dapeng, Gordon, Tom, Jim, Don, Vicky, Patrick, Jim
Annotation like this… Voice Annotation
Annotation when you feel like it, how you feel like it • Screensaver is the killer app!
Using my life bits:the value of time & time posts #3: “I remember when…” The 1st or 2nd most important retrieval handle.
M Stewart Lifeline v2 Mark Stewart’s Lifeline Copyright Mark Stewart, 2004
F: father F: mother F: self F: Sister Education Education Education F: spouse F: son F: grandChild F: grandChild F: daughter F: grandchild F: grandchild F: Significant Other W/Education Work Work Work Work Work Work Work Organization
Using my life bits:Where, an essential attribute #4: I remember where Just essential.
Using my life bits: pivoting on data to aid recall #5: Relationships (links) Using something near ‘it”, to find “it”.
MyLifeBits Entities & Links Photo of Event Caller in Phone Call Annotates Transcludes
Using my life bits:never enough meta-data …but, can you afford it?b #6: more meta-data (properties) I remember something about the content (understanding a person’s work)
Using my life bits:classification of everything #7: classification Is any gain from non-automated classification worth the cost and pain?
Is traditional classification required? …at OCLC there was unanimous agreement among faculty and participants that“access to electronic resources requires controlled vocabulary and classification”OCLC Institute, “Knowledge Access Management: Tools and Concepts for Next Generation Catalogers”, 17-19 November 1997, Dublin, Ohio.
“I have watched as hundreds of millions of dollars have been invested to re-invent the wheel - often badly.” -Marcia Bates
Professional Life: Organizations Administrivia Projects Library
Lederberg papers official reports Number of document segments
Abstracts Agendas not Announcements m; Application forms Articles m Autobiographies m Bibliographies m Biographies m Brochures m Certificates m Correspondence m Diaries m Drafts (documents) Drawings m Electronic images m Essays m Eulogies Excerpts Grant proposals Interviews m Invitations Laboratory notebooks m Laboratory notes Lecture notes Lectures m Legal documents m Legislative records Lists Manifestoes Memoirs m Minutes Monographs m Narratives Newsletters Newspaper columns m Notebooks m Notes Obituaries Official reports Oral histories m Petitions Photographic prints m Lederberg Artifact types Press releases m Procedures Proceedings m Programs m Proposals m Questionnaires Reminiscences Reports m Resolutions Resumes Reviews m School records Speeches m Summaries Tables (documents) Technical reports m Transcripts m Typescripts Video recordings m