Update README.md
Browse files
README.md
CHANGED
@@ -9,12 +9,22 @@ metrics:
|
|
9 |
- accuracy
|
10 |
---
|
11 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
<p><b> language identifier. </b>
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
|
17 |
-
|
18 |
-
|
19 |
-
|
20 |
|
|
|
9 |
- accuracy
|
10 |
---
|
11 |
|
12 |
+
<p><b> Til identifikatori.</b>
|
13 |
+
|
14 |
+
Tabiiy tilni qayta ishlash (NLP) sohasida tilni aniqlash vazifasi ma'lum matn yoki hujjat tilini aniqlashni o'z ichiga oladi,
|
15 |
+
ammo ko'plab tillarni aniqlash qobiliyati qiyinlashadi. Ushbu model matndan 21 tilni tanib oladi,xususan, oʻzbek tilida
|
16 |
+
qoʻllaniladigan lotin-kirill yozuviga eʼtibor qaratadi. Bu boradagi tadqiqotlar kamligini hisobga olib, mos transformator
|
17 |
+
arxitekturasiga asoslangan oʻzbek lotin-kirill yozuvini aniqlik darajasi yuqori boʻlgan tilni aniqlash modelini taqdim etamiz.
|
18 |
+
Modelimiz biz yaratgan o‘zbek tili korpusidan foydalangan holda baholandi, bu ham kelajakda o‘zbek tilini aniqlash vazifalarini
|
19 |
+
baholash uchun qimmatli manba bo‘lib xizmat qilishi mumkin.Ushbu model 21 ta tilni, jumladan, ikkita alifboda (lotin va kirill)
|
20 |
+
ifodalangan o‘zbek tilini qamrab oladi.
|
21 |
+
|
22 |
<p><b> language identifier. </b>
|
23 |
+
|
24 |
+
The task of language identification in Natural Language Processing (NLP) involves identifying the language of a particular text or document,
|
25 |
+
but the ability to identify multiple languages can be challenging. This model is capable of recognizing 21 languages from text, specifically
|
26 |
+
focusing on the Latin-Cyrillic script used in Uzbek. Considering the scarcity of research in this area, we present a language identification
|
27 |
+
model with a high degree of accuracy for the Uzbek Latin-Cyrillic script, based on the relevant transformer architecture. Our model has been
|
28 |
+
evaluated using the Uzbek corpus that we created, which can potentially serve as a valuable resource for evaluating language identification
|
29 |
+
tasks for Uzbek in the future. This model encompasses 21 languages, including Uzbek expressed in two scripts (Latin and Cyrillic).
|
30 |
|