DID-ACTE: Digging into Data: Automating Chinese Text Extraction
The Automating Data Extraction from Chinese Texts Project aims to provide humanists and social scientists with a means of transforming 2200 years of Chinese texts into structured data. The project will develop an open-source platform (MARKUS) that allows users to apply sophisticated text-mining techniques to a wide variety of historical and literary texts. Users will be able to tag and extract personal names, dates, place names, official titles and postings, kinship ties, other social relationships, and other user-defined content. The platform will be tested against 2000 local histories spanning an 800-year period and roughly 20,000 letters and 500 notebooks dating from the seventh through the thirteenth century. Data extracted from the sample repositories will be used to enrich text-mining applications and will also be made available for research through open-access online databases and data archives.
-didacte: related to teaching; (self-)taught; learned (independently)
Recent Tweets
-
@Sean Wang 王修恩
📰 @MPIWG new publication on images in Chinese local gazetteers hot off the press! https://t.co/P0uz1DdLjF We discu… https://t.co/dNgvEgEsEC11 months, 3 weeks ago -
@PaulSpence
Interested in #transcultural approaches to digital study/practice? Deadline for proposals (March 16th) for our ‘Dis… https://t.co/dnCp9BHcq512 months ago -
@Centre for Digital Scholarship
Please note the upcoming deadline of Monday, March 2nd (23:59 CET) for the Call for Papers: Digital Humanities Bene… https://t.co/EjAmNUbPyx12 months ago -
@𝘋𝘪𝘨𝘪𝘵𝘢𝘭 𝘔𝘢𝘱𝘱𝘢
☞☞☞ Note: to view these DM 2.0 projects, Chrome or Firefox browsers are recommended for viewing. Today's release… https://t.co/5azuwj0MC612 months ago -
@Paul Vierthaler
Want to see a real research example of using regular expressions to transform a 3000 page natural language document… https://t.co/3KCBczAY8k1 year ago -
@Center for Open Data in the Humanities (CODH)
なお、Googleストーリーの記事には詳細情報へのリンクがありませんので、より詳しく知りたい方は、以下のページなどをご覧ください。 KuroNet https://t.co/kPSyYbTDTn KuroNetに関する論文(I… https://t.co/P24vM9KOp41 year ago