XCLUB-COOL STUFF AROUND YOU

 找回密码
 Register
查看: 64|回复: 2
打印 上一主题 下一主题

IIT Madras team develops easy OCR system for nine Indian languages

[复制链接]

9356

主题

3万

帖子

4万

积分

Jade Diamond LV39

Daily Check-inHappy children's dayHot PartyPOP OUTACTIVE STAR4.0xclubpost star1post star2sign star1sign star2sign star3post star3post star4post star5sign star4X'Club badge exclusive for India

跳转到指定楼层
#1
发表于 2019-04-28 23:50:03 来自手机 | 只看该作者 |只看大图 回帖奖励 |倒序浏览 |阅读模式
[br] [br][br]Taking a cue from European languages, several of which have the same (Roman letter–based) script, Srinivasa Chakravathy’s team at IIT Madras has, over the last decade, developed a unified script for nine Indian languages, named the Bharati script. [br][br]The team has now gone a step further since developing the script: it has developed a method for reading documents in Bharati script using a multi-lingual optical character recognition (OCR) scheme. [br][br]The team has also created a finger-spelling method that can be used to generate a sign language for hearing-impaired persons. In collaboration with TCS Mumbai, the researchers have found a way for persons with hearing disability to generate signatures using this finger-spelling technique.[br][br]The scripts that have been integrated include Devnagari, Bengali, Gurmukhi, Gujarati, Oriya, Telugu, Kannada, Malayalam and Tamil. English and Urdu have not been integrated so far. Dr Chakravarthy says, “Urdu and English alphabet systems have a very different phonetic organisation. But that does not mean a mapping is not possible. It is quite possible and can be done.”[br][br]In general, optical character recognition schemes involve first separating (or segmenting) the document into text and non-text. The text is then segmented into paragraphs, sentences words and letters. Each letter has to be recognised as a character in some recognisable format such as ASCII or Unicode. The letter has various components such as the basic consonant, consonant modifiers, vowels etc.[br][br][br] [br][br]Easy to read[br][br]The scripts of Indian languages pose a problem for such a character recognition because the vowel and consonant-modifier components are attached to the main consonant part. This difficulty is removed in the Bharati script which can be easily read. “In Bharati characters, these different components are segmentable by design. So OCR works quite accurately. Our OCR engines gives almost 100% accuracy even with mild noise added,” says Dr Chakravarthy.[br][br]Three-tiered structure[br][br]The ease in design comes about because the Bharati characters are made up of three tiers stacked vertically. The consonant at the root of the letter is placed in the centre and the modifiers are in the top and bottom tiers.[br][br]In collaboration with Sunil Kopparappu of Innovation Labs, TCS, Mumbai, the team has developed a universal finger-spelling language for the nine Indian languages. They are working on a system that can help people sign documents using a finger-spelling method, and future plans include developing a new Braille system with the Bharati script.[br]
Never give up
回复

使用道具 举报

3469

主题

5万

帖子

5万

积分

Sapphire Diamond LV41

Daily Check-in2019sign star1sign star2sign star3sign star4post star1post star2post star3post star4post star5X'Club badge exclusive for IndiaXclub Kol

#2
发表于 2019-04-28 23:53:06 来自手机 | 只看该作者
Good Informative Share!
回复

使用道具 举报

1万

主题

8万

帖子

9万

积分

Sapphire Diamond LV48

Crazy TechnologyDaily Check-inTech Fans2020PL KINGHappy children's dayEid PartyHot PartyDiamondHappy Easter Medal.pngGood Wallpaper DesignerChocolate Day MedalHug Day MedalKiss Day MedalPromise Day MedalPropose Day MedalTeddy Day Medalsingel medalACTIVE STAR2020 Wish Medal20M4.0xclub2019post star1post star2post star3sign star1sign star2sign star3sign star4post star4post star5X'Club badge exclusive for Indiasign star5

#3
发表于 2019-04-29 00:19:28 来自手机 | 只看该作者
Nice share
回复

使用道具 举报

高级模式
B Color Link Quote Code Smilies |上传

本版积分规则

Infinix Official Website|Infinix official mall|infinix Note 4|XCLUB-COOL STUFF AROUND YOU

GMT+8, 2025-08-23 13:53 , Processed in 0.038291 second(s), 22 queries .

Powered by Discuz! X3.4

© 2001-2017 Comsenz Inc.

快速回复 返回顶部 返回列表