セッション概要

In today's Information Age large volumes of information are available in English – whether it be information for competitive exams or general reading. However, many people whose primary language is Hindi or any other Indian language (or, Japanese, too) are unable to access information in English. Anusaaraka (we) aims to bridge this language barrier by allowing a user to enter an English text into Anusaaraka and read it in an Indian language (or, Japanese) of their choice.

What the audience can bring back: Machine translation convert one language to another language, learn more about it! The audience will learn more about why Anusaaraka aims to bridge a language barrier by allowing a user to enter an English text into Anusaaraka and read it in an Indian language of their choice (Japanese). When any machine does that work they need big parallel corpus that can help for making some rules and disambiguate many senses, let's look into that!

I've worked on (in connection with this topic):
– Implemented Word Analyzer, Local Word Grouper as a part of Standard components of MT system along with mapping
using Bilingual Dictionaries.
– Successfully implemented the Word analyzer algorithm. It was designed using Sandhi, Wrong breaking of words, and
grammar for Written Language. Spelling variation also occurs because Telugu does not make.
– Worked on generating the converters algorithm
– I'm an Undergraduate Researcher at Anusaaraka lab - Language Translation, and NLP for Indian languages, LTRC, IIIT-Hyderabad under the supervision of Professor Vineet Chaitanya.

I'm from India, and I'd like to take part individually. I'm a part of PyCon JP staff team.

Anusaaraka Lab: it's one of the labs of the Language Technologies Research Centre (LTRC). LTRC is the largest academic centre of speech and language technology in South Asia, and LTRC carries out its work through four labs, which work in synergy with each other.

• https://code.google.com/archive/p/anusaaraka/
• https://sourceforge.net/projects/anusaaraka/
• http://cdn.iiit.ac.in/cdn/ltrc.iiit.ac.in/downloads/nlpbook/nlp-panini.pdf

講師

Jasveen S

(Anusaaraka Lab, LTRC (Language Technologies Research Centre), IIIT Hyderabad (International Institute of Information Technology Hyderabad))

セッション情報

担当:: Anusaaraka Lab, LTRC (Language Technologies Research Centre)
対象者:: Intermediate (finished Python tutorial and worked on third-party libraries for web app development, data analysis, automation, etc.)
前提知識:: Having an Operating system of POSIX compatible is a plus. Nothing too specific is a prerequisite. Interest in Python (MT- Machine translation) is obviously important. 50% (Audience has half the expertise of the presenter)
カテゴリー:: translation software/machine translation tool

資料

PDFをダウンロード