PPT - Current work on CitEc PowerPoint Presentation, free download

Current work on CitEc José Manuel Barrueco Cruz http://www.uv.es/~barrueco Thomas Krichel http://openlib.org/home/krichel

Data • Papers from RePEc dataset • 31139 Working Papers • 15145 Journal Articles • all of them available online, not all are free • More than 90% of them are in PDF or PostScript formats

Harvesting • Perl script that: • Reads the RePEc data • Downloads the documents full text • Converts them to ASCII (using pstotext) • Tries to find a Reference section

Test on 1000 documents • 13% are not found in the URL specified • 3% are not it PDF or PS • 15% give errors in the pstotext conversion • 9% are converted but a reference section can not be found • 60% were successfully converted

Parsing problems of CiteSeer • Publication date. When a reference contains more than one year it is discarded • Source of publication, i.e. working papers series or journals titles is not parsed be CiteSeer. We will need to add code with a list of all journals and working paper series.

To do • Study of citation patterns • Use of data in user services • Use of data in logging and registration services

Thank you for your attention. Contact José Manuel Barrueco Cruz for more information

Current work on CitEc

Presentation Transcript

( current &amp; future work )

Overview of Current Work-plan Activities

Current Work on Hydraulics for LIGO 1 and Advanced LIGO

Current Work in System Architecture

Current Work in System Architecture

Current Shortage Designation Work Flow

Current Work

Current perspectives on Lipodystrophy

Current Work

Current Work on Regulatory Barriers at the Bloustein School

Current Work

Current work on methane and tropospheric bromine Daniel J. Jacob

Current Work

Current Work

Current Work – 20/04/2006

Protocols Recent and Current Work.

Current Work

Current Work on Regulatory Barriers at the Bloustein School

Current work at UCL &amp; KCL

Current Work in System Architecture

Status of current work on Faculae and Sunspots

Current work on CitEc

Presentation Transcript

( current &amp;amp; future work )

Overview of Current Work-plan Activities

Current Work on Hydraulics for LIGO 1 and Advanced LIGO

Current Work in System Architecture

Current Work in System Architecture

Current Shortage Designation Work Flow

Current Work

Current perspectives on Lipodystrophy

Current Work

Current Work on Regulatory Barriers at the Bloustein School

Current Work

Current work on methane and tropospheric bromine Daniel J. Jacob

Current Work

Current Work

Current Work – 20/04/2006

Protocols Recent and Current Work.

Current Work

Current Work on Regulatory Barriers at the Bloustein School

Current work at UCL &amp;amp; KCL

Current Work in System Architecture

Status of current work on Faculae and Sunspots

( current & future work )

Current work at UCL & KCL