Research & Development (Neural Machine Translation) Intern

Applications are closed for this internship. Click here to browse more internships.
Research & Development (Neural Machine Translation)

IIT Bombay

Start Date
15 Jun - 30 Jun' 20
Duration
6 Months
Stipend
₹ 3,000-8,000 /month
APPLY BY
31 May' 20
Posted 3 weeks ago
Internship

About the internship

Selected intern's day-to-day responsibilities include:

1. Work on data cleaning, pre-processing, and text parsing
2. Write well designed, testable, efficient code by using best software development practices
3. Work on implementation of ML model for unsupervised tasks
4. Engage in server handling and API deployment
5. Stay plugged into emerging technologies/industry trends and applying them to operations and activities
6. Develop the next generation of core MT technology to allow our users to communicate across language barriers

Skill(s) required

Deep Learning Machine Learning Natural Language Processing (NLP) Python Software Testing

Who can apply

Only those candidates can apply who:

1. are available for full time (in-office) internship

2. can start the internship between 15th Jun'20 and 30th Jun'20

3. are available for duration of 6 months

4. have relevant skills and interests

Added requirements

1. Expertise in key language technologies including machine translation or natural language processing

2. Proven background in machine learning and deep learning including deep neural networks, sequence-to-sequence models, etc.

3. In-depth knowledge of architectures like Transformers, Encoder-Decoder, LSTMs, RNNs, etc.

4. Hands-on experience with deep learning toolkits including Tensorflow, PyTorch, Keras, etc.

5. Ability to formulate a research problem, design, experiment and implement solutions in Python

6. Excellent spoken and written communication skills

7. Strong dedication and consistency towards long research projects

8. Good to have experience working with standard MT/NLP toolkits, e.g. Sockeye, OpenNMT, etc.

Perks

Certificate Letter of recommendation Informal dress code
Additional information

This is an in-office internship & will start in June.

The need for translating domain specific content such as legal documents, technical and non-technical documents, educational materials, government procedures and services is increasing exponentially. Most of the tools manufactured by Russia are written in Russian. These need to be translated efficiently into English to be of benefit to the Indian Navy.

This includes the automated translations for Standards (GOST), Operating Documents, Repair Technical Documents (RTD), Technical Drawings, Contracts, Supplementary Agreements (SAs), Price Catalogues, Speeches, Minutes of Meetings, etc. These documents are available in formats like Word, Excel, PDF, Power Point, Image etc. The translation process presently being undertaken by RTC is manual.

Manual Translation of documents is evidently tedious and time intensive. Hence the goal is to build processes and models that would lead to enhanced translation tools enabling large-scale translation of domain specific content into English. We propose an online framework for translating Russian to English.

Number of openings

1

About IIT Bombay

The Indian Institute of Technology, Bombay (IITB) is one of the fifteen higher institutes of technology in the country, set up intending to make facilities available for higher education, research, and training in various fields of science and technology. Professor Ganesh Ramakrishnan (department of CSE) and professor Ramasubramanian (department of humanities and social sciences) are attempting to significantly speed up the process of digitization of Sanskrit texts. Enabled by the OCR and post-editing related technologies developed at IIT Bombay, they are now seeking the participation of the community of Sanskrit lovers, software developers, machine learning enthusiasts, project managers, etc.
Activity on Internshala
Hiring since December 2013
415 opportunities posted
107 candidates hired
Sign up to continue

OR

By signing up, you agree to our Terms and Conditions.