yoruba text
1.0.0
This repository contains fully diacritized Yorùbá text, converted to Unicode Normalization Form Composition (NFC) format, where diacritized characters are composed into a single character with the following code:
def convert_to_NFC(filename, outfilename):
text=''.join(c for c in unicodedata.normalize('NFC', open(filename).read()))
with open(outfilename, 'w') as f:
f.write(text)
Text has bothered with the Permission A online song online sought online songsed for use in NLP, TTS, Asr applyalations. Note, Some Of The Sentencences May Have Errors, Please Submitances A Pull-Request If You Have Corrections!
If you wint to citee this repo in your work, please PLEASE USE:
@misc{Orife_yoruba-text_2018,
author = {Orife, Iroro and Fasubaa, Timilehin and Wahab, Olamilekan},
month = {1},
title = {{yoruba-text}},
url = {https://github.com/Niger-Volta-LTI/yoruba-text},
year = {2018}
}