tamil package¶
Submodules¶
tamil.date module¶
-
class
tamil.date.
DateUtils
[source]¶ -
DAY
= u'\u0ba8\u0bbe\u0bb3\u0bcd'¶
-
DAY_SUFFIX
= u'\u0b95\u0bbf\u0bb4\u0bae\u0bc8'¶
-
HOUR
= u'\u0bae\u0ba3\u0bbf'¶
-
MINUTE
= u'\u0ba8\u0bbf\u0bae\u0bbf\u0b9f\u0bae\u0bcd'¶
-
MONTH
= u'\u0bae\u0bbe\u0ba4\u0bae\u0bcd'¶
-
MONTHS
= {u'April': u'\u0b8f\u0baa\u0bcd\u0bb0\u0bb2\u0bcd', u'August': u'\u0b86\u0b95\u0bb8\u0bcd\u0b9f\u0bcd', u'December': u'\u0b9f\u0bbf\u0b9a\u0bae\u0bcd\u0baa\u0bb0\u0bcd', u'February': u'\u0baa\u0bbf\u0baa\u0bcd\u0bb0\u0bb5\u0bb0\u0bbf', u'January': u'\u0b9c\u0ba9\u0bb5\u0bb0\u0bbf', u'July': u'\u0b9c\u0bc2\u0bb2\u0bc8', u'June': u'\u0b9c\u0bc2\u0ba9\u0bcd', u'March': u'\u0bae\u0bbe\u0bb0\u0bcd\u0b9a\u0bcd', u'May': u'\u0bae\u0bc7', u'November': u'\u0ba8\u0bb5\u0bae\u0bcd\u0baa\u0bb0\u0bcd', u'October': u'\u0b85\u0b95\u0bcd\u0b9f\u0bc7\u0bbe\u0baa\u0bb0\u0bcd', u'September': u'\u0b9a\u0bc6\u0baa\u0bcd\u0b9f\u0bae\u0bcd\u0baa\u0bb0\u0bcd'}¶
-
MONTHS_INDEX
= [None, u'January', u'February', u'March', u'April', u'May', u'June', u'July', u'August', u'September', u'October', u'November', u'December']¶
-
TIME
= u'\u0ba8\u0bc7\u0bb0\u0bae\u0bcd'¶
-
WEEK
= u'\u0bb5\u0bbe\u0bb0\u0bae\u0bcd'¶
-
WEEKDAYS
= {u'friday': u'\u0bb5\u0bc6\u0bb3\u0bcd\u0bb3\u0bbf', u'monday': u'\u0ba4\u0bbf\u0b99\u0bcd\u0b95\u0bb3\u0bcd', u'saturday': u'\u0b9a\u0ba9\u0bbf\u0b95\u0bcd\u0b95\u0bbf\u0bb4\u0bae\u0bc8', u'sunday': u'\u0b9e\u0bbe\u0baf\u0bbf\u0bb1\u0bc1', u'thursday': u'\u0bb5\u0bbf\u0baf\u0bbe\u0bb4\u0ba9\u0bcd', u'tuesday': u'\u0b9a\u0bc6\u0bb5\u0bcd\u0bb5\u0bbe\u0baf\u0bcd', u'wednesday': u'\u0baa\u0bc1\u0ba4\u0ba9\u0bcd'}¶
-
WEEKDAYS_INDEX
= [u'monday', u'tuesday', u'wednesday', u'thursday', u'friday', u'saturday', u'sunday']¶
-
YEAR
= u'\u0b86\u0ba3\u0bcd\u0b9f\u0bc1'¶
-
tamil.iscii module¶
tamil.numeral module¶
tamil.regexp module¶
tamil.tscii module¶
tamil.tweetparser module¶
tamil.utf8 module¶
-
tamil.utf8.
all_tamil
(word_in)[source]¶ predicate checks if all letters of the input word are Tamil letters
-
tamil.utf8.
compare_words_lexicographic
(word_a, word_b)[source]¶ compare words in Tamil lexicographic order
-
tamil.utf8.
get_letters
(word)[source]¶ splits the word into a character-list of tamil/english characters present in the stream
-
tamil.utf8.
get_letters_iterable
(word)[source]¶ splits the word into a character-list of tamil/english characters present in the stream
-
tamil.utf8.
get_tamil_words
(letters)[source]¶ reverse a Tamil word according to letters, not unicode-points
-
tamil.utf8.
get_words_iterable
(letters, tamil_only=False)[source]¶ given a list of UTF-8 letters section them into words, grouping them at spaces
-
tamil.utf8.
is_tamil_unicode_predicate
(x)¶
-
tamil.utf8.
istamil
(tchar)[source]¶ check if the letter tchar is prefix of any of tamil-letter. It suggests we have a tamil identifier
-
tamil.utf8.
istamil_alnum
(tchar)[source]¶ check if the character is alphanumeric, or tamil. This saves time from running through istamil() check.
-
tamil.utf8.
istamil_prefix
(word)[source]¶ check if the given word has a tamil prefix. Returns either a True/False flag
-
tamil.utf8.
joinMeiUyir
(mei_char, uyir_char)[source]¶ This function join mei character and uyir character, and retuns as compound uyirmei unicode character.
- Inputs:
- mei_char : It must be unicode tamil mei char. uyir_char : It must be unicode tamil uyir char.
Written By : Arulalan.T Date : 22.09.2014
-
tamil.utf8.
letters_to_py
(_letters)[source]¶ return list of letters e.g. uyir_letters as a Python list
-
tamil.utf8.
splitMeiUyir
(uyirmei_char)[source]¶ This function split uyirmei compound character into mei + uyir characters and returns in tuple.
Input : It must be unicode tamil char.
Written By : Arulalan.T Date : 22.09.2014
-
tamil.utf8.
tamil
(idx)[source]¶ retrieve Tamil letter at canonical index from array utf8.tamil_letters
-
tamil.utf8.
to_unicode_repr
(_letter)[source]¶ helpful in situations where browser/app may recognize Unicode encoding in the u0b8e type syntax but not actual unicode glyph/code-point