1 / 28

DMT 2011 Week 2

DMT 2011 Week 2. Leiden University. The university to discover. Adriaan van der Weel. Leiden University. The university to discover. The digital lifecycle. Leiden University. The university to discover. DMT 2011 Project. Erven F. Bohn correspondence Creation: Transcription Proofing

mayes
Télécharger la présentation

DMT 2011 Week 2

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. DMT 2011Week 2 Leiden University. The university to discover. • Adriaan van der Weel

  2. Leiden University. The university to discover. The digital lifecycle

  3. Leiden University. The university to discover. DMT 2011 Project • Erven F. Bohn correspondence • Creation: • Transcription • Proofing • ‘Enrichment’: encoding in TEI XML • Storage • Retrieval • Publication

  4. Leiden University. The university to discover. Recapitulation

  5. Leiden University. The university to discover. The computer as a medium 1 • From the book to digital ways of ordering information • Media as knowledge machines • Characteristics of media determine the nature of the knowledge machine

  6. Leiden University. The university to discover. The computer as a medium 2 • Publishing: • Conventional, involving text like books, journals, newspapers, etc. • Cultural heritage, involving any modality (but BDMS limits itself to text and images) • Digital born vs digitised materials • Profit vs non-profit

  7. Leiden University. The university to discover. More than a medium • The computer is • A medium (replacement for printed textual transmission) • A Universal Machine • As a Universal Machine the computer is the vehicle for ‘humanities computing’ • No clear division

  8. Leiden University. The university to discover. Digital ‘knowledge machine’ • More than a digital ‘book’: • Making text intelligent (main focus) • Using intelligent agents to mine text • People (Web 2.0) • Need markup: • Documents • Rules • Styles • Markup application

  9. Leiden University. The university to discover. XML basics • 1. Application • Document instances • Validation (DTD, Schema) • Styles/transformations (Week 5-7) • 2. The markup language • Elements • Attributes • Entities • ASCII and Unicode (next week)

  10. Leiden University. The university to discover. The markup application

  11. Leiden University. The university to discover. Today: The Book Trade Correspondence Project

  12. Leiden University. The university to discover. The Bohn publishing company • Bohn founded Haarlem, 1752 • By Christoph Henrich Bohn from Lübeck (Germany) • Son: François Bohn (1751-1819) • 1875 no more bookselling • Up to 1900: 55% literature, philology, music, arts, history, geography, travel;10% school books; 35% general, children’s, theology, science and social science • After 1900: [chiefly science and professional]

  13. Leiden University. The university to discover. The Bohn archive • Provenance: Erven F. Bohn [publishers], Haarlem • Ca 1973, through Ernst Braches, keeper of rare books of the University Library • Was using the archive for PhD research on ‘De nieuwe kunst’ • Finding aid (EAD)

  14. Leiden University. The university to discover. The Bohn archive -2- • General documentation • Financial • Correspondence • Editorial and title production • Litigation • Etc.

  15. Leiden University. The university to discover. Correspondence (incoming)

  16. Leiden University. The university to discover. Transcription • Dear Sirs, • I will accept / • £10 for the / • rights to make a / • translation into / • Dutch of my / • novel entitled / • Wanda // • Printers will / • send you entire / • proofs from London / • instantly. Please / • to send money / • on receipt of this / • Address Madame / • Ouida. ~c. 2 words illegible~/ • ~c. 1 word illegible~ Ouida / • L. de la Ramée

  17. Leiden University. The university to discover. Correspondence (outgoing) • An explanation of how letters in were copied in the nineteenth century

  18. Leiden University. The university to discover. • <div type="letter"> <opener> <dateline> <date when="1920-12-04"> 4 Dec <supplied reason="from preprinted letterhead">19</supplied>20. </date> </dateline> <address> <addrLine> <orgName> B.H. Blackwell Ltd </orgName> </addrLine> <addrLine> Oxford </addrLine> </address> <salute> Dear Sir, </salute> </opener> <p> In reply to your order d.d.22/11 <lb/> we can inform you that <title>Folia Neuro-<lb/ Biologica</title> Band 13 has not yet appeared. </p> <closer> <salute> Yours truly, </salute> <signed>p.o De Erven F. Bohn </signed> <signed> M.K. </signed> </closer> </div>

  19. Leiden University. The university to discover. Seminar • 1. HTML exercise • 2. Using the BDMS server • 3. Document analysis

  20. HTML <!DOCTYPE HTML PUBLIC “-//W3C//DTD HTML 4.01 Transitional//EN” “http://www.w3.org/TR/html4/loose.dtd”><html> <head> <title>[Title of the web page]</title> </head> <body> </body> </html>

  21. <p> for paragraphs • <b> for bold text • <i> for italic text

  22. Hyperlinks: <a href=“http://www.bookandbyte.org” target=“_new” > Click here</a> • Images:<img src=“http://www.flickr.com/photos/library_of_congress/4120044616/”/> • The browser must be able to access the referenced file using the path that you provide.

  23. <table width=“100%”><tr> <td> Row 1, column 1</td> <td>Row 1, column 2</td></tr><tr> <td> Row 2, column 1</td> <td> Row 2, column 2 </td> </tr> </table>

  24. stylesheet.css stylesheet.css body { margin-left: 10%; margin-right: 10%; font-family: Arial; } P { text-indent: 1.5em; } <link type=”text/css” rel=”stylesheet” href=”stylesheet.css”> <link type=”text/css” rel=”stylesheet” href=”stylesheet.css”>

  25. One double quote is not the same as two single quotes! • Be careful with “multiple extentions” in filenames! E.g. dmt.exercise1.html • Tags must be nested properly. • MS Word may convert regular double quotes to “smart quotes”

  26. Leiden University. The university to discover. From typography to markup • Homo typographicus • We are conditioned by books • We live in the ‘Order of the Book’ • Books order and structure: • The ‘libroverse’ contains books • Inside each book • Typography is an implicit structuring and ordering device • Computers need explicit instructions: • Markup: HTML, XML, TEI, etc.

  27. Leiden University. The university to discover. Document analysis • What relevant knowable things exist • How can we classify them • What is their relationship • Different categories of text have different relevant classes of knowable things, e.g.: • Correspondence • Drama • Poetry

  28. Document Analysis Exercise • Weekly programme > Week 2

More Related