Compact N-gram Language Models for Armenian

Karamyan, Davit S.; Karamyan, Tigran S.

Download

Title: Compact N-gram Language Models for Armenian

Volume:

ISSN:

2579-2784 ; e-2538-2788

Official URL:

Additional Information:

Քարամյան Դավիթ Ս․, Քարամյան Տիգրան Ս., Карамян Давид С., Карамян Тигран С.

Other title:

Կոմպակտ N-գրամ լեզվի մոդելներ հայերենի համար ; Компактные языковые модели N-грамм для армянского языка

Abstract:

Applications such as speech recognition and machine translation use language models to select the most likely translation among many hypotheses. For on-device applications, inference time and model size are just as important as performance. In thiswork, we explored the fastest family of language models: the N-gram models for the Armenian language. In addition, we researched the impact of pruning and quantization methods on model size reduction. Finally, we used Bye Pair Encoding to builda subword language model. As a result, we obtained a compact (100 MB) subwordlanguage model trained on massive Armenian corpora.

Publisher:

Изд-во НАН РА

Format:

pdf

Extent:

էջ 30-38

Identifier:

oai:arar.sci.am:323480

Language:

Location of original object:

ՀՀ ԳԱԱ Հիմնարար գիտական գրադարան

Subject and keywords:

Mathematical cybernetics Computer science Armenian language N-gram Language Model Subword Language Model Pruning Quantization

Object collections:

Digital Library > Articles

Last modified:

Aug 18, 2025

In our library since:

Jul 14, 2022

Number of object content hits:

154

All available object's versions:

https://arar.sci.am/publication/351118

Show description in RDF format:

RDF

Show description in OAI-PMH format:

OAI-PMH

ՀՀ ԳԱԱ Շարունակական հրատարակություններ

Edition name	Date
Karamyan, Davit S., Compact N-gram Language Models for Armenian	Aug 18, 2025

Object

Title: Compact N-gram Language Models for Armenian

Creator:

Type:

Journal or Publication Title:

Date of publication: