Low-resource text classification using cross-lingual models for bullying detection in the ukrainian language

V. Oliinyk; I. Matviichuk

doi:10.20535/1560-8956.42.2023.279093

Adaptivni Sistemi Avtomatičnogo Upravlinnâ (May 2023)

Low-resource text classification using cross-lingual models for bullying detection in the ukrainian language

V. Oliinyk,
I. Matviichuk

Affiliations

V. Oliinyk: Igor Sikorsky Kyiv Polytechnic Institute
I. Matviichuk: Igor Sikorsky Kyiv Polytechnic Institute

DOI: https://doi.org/10.20535/1560-8956.42.2023.279093
Journal volume & issue: Vol. 1, no. 42
pp. 87 – 100

Abstract

Read online

The object of research is multilingual models for training on limited datasets. The article reviews multilingual models for training on limited datasets and analyzes their development. Multilingual models are used for many low-resource languages, but Ukrainian is not one of them. The purpose of the work is to increase the effectiveness of text classification in the conditions of a limited set of data in the Ukrainian language by using multilingual models, a zero-shot learning approach i.e. without a target language, and using machine translation to create or augment a dataset. Ref. 24, pic. 5, tabl. 3

Published in Adaptivni Sistemi Avtomatičnogo Upravlinnâ

ISSN: 1560-8956 (Print); 2522-9575 (Online)
Publisher: Igor Sikorsky Kyiv Polytechnic Institute
Country of publisher: Ukraine
LCC subjects: Technology: Technology (General): Industrial engineering. Management engineering: Automation
Website: http://asac.kpi.ua/

About the journal

Abstract

Keywords