Skip to content

Latest commit

 

History

History
29 lines (22 loc) · 1.8 KB

README.md

File metadata and controls

29 lines (22 loc) · 1.8 KB

Restored Qieyun and other data from Fujita (2017; 2023)

Source

  • Thesis
    藤田拓海. 2017. 陸法言『切韻』研究. 東京: 二松学舎大学. (博士学位論文)
  • Monograph
    藤田拓海. 2023. 陸法言『切韻』研究 (開篇 單刊 18). 東京: 好文出版.
    • Official page
    • The content is identical to the main text of the thesis, and its accompanying CD contains a PDF of the entire thesis

Extracted data

Phonological position descriptions 音韻地位描述 in TshetUinh.js v0.15 format are also added.

Process

  1. Download the full text PDF (see link above) as fujita.pdf in the directory
  2. raw.py: Extract all text elements in the appendix Qieyun Table 切韻表 of fujita.pdf to raw.pkl
  3. pages.py: Rebuild the table on each page from raw.pkl to pages.pkl
  4. lines.py: Stringify all lines in pages.pkl to fujita-data.csv and extract restored Qieyun data to 切韻 藤田拓海復元.csv and 切韻 李永富復元.csv
  5. small-rime-diffs.py: Compare the two restored versions and save to small-rime-diffs.csv