You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
HuangHai a72954a90e
'commit'
1 month ago
..
kie_dict 'commit' 1 month ago
layout_dict 'commit' 1 month ago
unimernet_tokenizer 'commit' 1 month ago
README.md 'commit' 1 month ago
ar_dict.txt 'commit' 1 month ago
arabic_dict.txt 'commit' 1 month ago
be_dict.txt 'commit' 1 month ago
bengali_dict.txt 'commit' 1 month ago
bg_dict.txt 'commit' 1 month ago
bm_dict.txt 'commit' 1 month ago
bm_dict_add.txt 'commit' 1 month ago
bn_dict.txt 'commit' 1 month ago
chinese_cht_dict.txt 'commit' 1 month ago
confuse.pkl 'commit' 1 month ago
cyrillic_dict.txt 'commit' 1 month ago
devanagari_dict.txt 'commit' 1 month ago
en_dict.txt 'commit' 1 month ago
fa_dict.txt 'commit' 1 month ago
french_dict.txt 'commit' 1 month ago
german_dict.txt 'commit' 1 month ago
gujarati_dict.txt 'commit' 1 month ago
hebrew_dict.txt 'commit' 1 month ago
hi_dict.txt 'commit' 1 month ago
it_dict.txt 'commit' 1 month ago
japan_dict.txt 'commit' 1 month ago
ka_dict.txt 'commit' 1 month ago
kazakh_dict.txt 'commit' 1 month ago
korean_dict.txt 'commit' 1 month ago
latex_ocr_tokenizer.json 'commit' 1 month ago
latex_symbol_dict.txt 'commit' 1 month ago
latin_dict.txt 'commit' 1 month ago
mr_dict.txt 'commit' 1 month ago
ne_dict.txt 'commit' 1 month ago
oc_dict.txt 'commit' 1 month ago
parseq_dict.txt 'commit' 1 month ago
ppocrv4_doc_dict.txt 'commit' 1 month ago
ppocrv5_dict.txt 'commit' 1 month ago
pu_dict.txt 'commit' 1 month ago
rs_dict.txt 'commit' 1 month ago
rsc_dict.txt 'commit' 1 month ago
ru_dict.txt 'commit' 1 month ago
samaritan_dict.txt 'commit' 1 month ago
spin_dict.txt 'commit' 1 month ago
syriac_dict.txt 'commit' 1 month ago
ta_dict.txt 'commit' 1 month ago
table_dict.txt 'commit' 1 month ago
table_master_structure_dict.txt 'commit' 1 month ago
table_structure_dict.txt 'commit' 1 month ago
table_structure_dict_ch.txt 'commit' 1 month ago
te_dict.txt 'commit' 1 month ago
th_dict.txt 'commit' 1 month ago
ug_dict.txt 'commit' 1 month ago
uk_dict.txt 'commit' 1 month ago
ur_dict.txt 'commit' 1 month ago
vi_dict.txt 'commit' 1 month ago
xi_dict.txt 'commit' 1 month ago

README.md

Dictionary and Corpus

Dictionary files (usually character level vocabulary) are included here for easier configuration. Corpus contributed by OSS contributors are listed here, please respect copyrights when using them at your own risk.