Creating specialized ontologies using Wikipedia: The Muninn Experience.
A Social Networking Approach to the Legal Learning Track TREC 2011
TREC 2011, Legal Learning Track
Attending Linked Open Data Library and Archives Meeting in San Francisco
Do a billion documents change the First World War?
Abstract:
Presentation at CASBS 2010: Muninn Project
In this talk I will review some of the methods being used in the Muninn project to extract information from the scanned documents of historical archives. Previous data extraction efforts for historical research were done through the human review of documents, one at a time. We employ an approach where computing power is used to collate similar document types to extract the information from them.
The Great War era produced a mix of hand-written and type-written documents that require processing using computer extraction methods assisted by the manual reviews of specific cases by human volunteers. I will contrast this with previous methods that have been used to digitize documents, such as recapchat, and close with some observations about managing archival data in a high-volume setting.
Paper at MEM2010: Canopener: Recycling Old and New Data
Now back at the University of Waterloo
Pages
- « first
- ‹ previous
- 1
- 2
- 3