History of the Japanese Language
Research resources for studying the history of the Japanese language
- Corpus of Historical Japanese
This corpus collects materials to research the history of the Japanese language. The development of the corpus is ongoing, with a view to producing a diachronic corpus which covers a period from the ancient times to the modern times. What is already built is available at the moment.
Chunagon is a web concordancer that enables a three-way search of the corpora developed by NINJAL. In Chunagon, short unit word, long unit word, and string are available. Using a combination of morphological information, it is possible to make an advanced search of the corpus.
- Corpora of Modern Japanese
This is corpus developed to research the Japanese language of the Meiji and Taisho eras. The ‘Taiyo corpus’, ‘Modern women’s magazines corpus’, ‘Meiroku Zasshi corpus’, and ‘Kokumin-no-Tomo corpus’ are available.
- Collection of the Research Library for Study of the Japanese Language History
The Research Library owns valuable archives for research into Japanese language history and development of the historical corpus, digital images of the archives can be browsed online.
- Transcription of The Tale of Genji Manuscript Book at the Library of Congress
This database provides the transcription of the Tale of Genji manuscript at the Library of Congress (LC, LC Control No.: 2008427768), and it has the function of full-text search.
- Images of The Tale of Genji Manuscript Book at the Library of Congress
This database provides images of the Tale of Genji manuscript at the Library of Congress (LC, LC Control No.: 2008427768), using a viewer that displays images of original texts and transliterated texts side-by-side. Kiritsubo, Suma and Kashiwagi are accessible.
- Web ChaMame
A tool to perform morphological analysis using various UniDic dictionaries. It allows researchers to perform a series of work necessary for morphological analysis on the Internet via a user-friendly interface.
- Text Data Sets for Research on the History of Japanese
This website provides the data sets (txt, xml, etc.) for research on the history of Japanese.
- Oxford-NINJAL Corpus of Old Japanese
“The Oxford-NINJAL Corpus of Old Japanese” is a lemmatized, parsed and comprehensively annotated digital corpus of all texts in Japanese from the Old Japanese period. In its present version, the ONCOJ contains the full corpus of Old Japanese poetic texts, including the Man'yōshū.
- Wokototen-zu Database
This is the Database of Wokototen-zu which is a reading mark chart of vernacular reading of a Chinese texts. It can search Wokototen by the type, position, shape, meaning of the Wokototen.
- Images of the Amakusa edition of Heike monogatari, Isoho monogatari and Kinkushū in the British Library collection
This database provides images of the Amakusa edition of Heike monogatari, Isoho monogatari, Kinkushū, Kotoba no yawarage and Japanese-Portuguese vocabulary in the British Library collection (BL, Shelfmark: Or.59.aa.1). All images are in the public domain.
- Images and Transcription of Isoho monogatari in the British Library collection
This site provides a viewer that displays the Amakusa edition’s images (‘Esopo no fabulas’ [Isoho monogatari]) and its transliterated texts side-by-side.
- Kunten Database
This is the Database of gloss information (Kunten information) on Shangshu (printed in the late 16th century) at the National Institute for Japanese Language and Linguistics.
- NINJAL Hentaigana Database
This site provides a database and a viewer that indexes glyph images of Hentaigana by reading, base form, and Unicode ID.
It has functions such as mutual links with source images and options to display the source image and the transliterated text side by side.
- Japanese Word History and Frequency
This tool allows users to search for statistical information from the corpus, old dictionaries, language maps, language articles, and other materials related to the history of the Japanese language at one time.