#{SCRIPTS_HERE}

Language pack for தமிழ் (Tamil)

Status: published. This language pack has already been finalized and published.

Character map

The following characters, grouped by Unicode script, are considered for this language pack.

Tamil

ி

Common






!
"
#
$
%
&
'
(
)
*
+
,
-
.
/
0
1
2
3
4
5
6
7
8
9
:
;
<
=
>
?
@
^
_
`
{
}
~
¡
¢
£
¤
¥
¦
§
¨
©
«
¬
­
°
·
»
¼
÷
˜


Latin

A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Y
Z
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
y
z
ª
Ò
Ô
Õ
Ù
à
è
é
ë
í
ï
ó
ô
õ
ö
ø
ù
ú
û
ü
ý
þ
Œ
œ
Š
ž
ƒ

Inherited

Devanagari

ि

Unknown

Greek

μ

Language pack rules

Starting with all characters that appear in the example sentences, declare the following characters as:

Used Huffman trees

Example sentences

Miscellaneous

Example sentences have been downloaded from http://corpora.uni-leipzig.de/downloads/tam_newscrawl_2011_100K-text.tar.gz

Symbol meanings:

A
    The character has appeared less than 10 times in the example sentences.
A
    The character has appeared at least 10 times, but was seen very rarely.
A
    The character has been seen very often.
A
    The character is included as an important character.
A
    The character is included as an important character, but there's already a lowercase variant of it.
A
    The character is included as a supplementary character.
A
    The character is excluded from the language pack.

Hint: Hover over a character to see its unicode name.