1. Work on following the said SDLC
2. Work on data mining and text analytics
3. Engage in mandatory visits of various centers to gather information and understand ongoing methodology/ process
5. Articulately communicate and interact by asking pertinent questions in a detail oriented manner (during field visits)
6. Document the process to facilitate effective requirement gathering
7. List different scope of ML/AI solutions that could alleviate the said problems
8. Engage in the analytical/feasibility study and prepare a proof of concept (POC)
9. Design a platform independent system prototype/model
10. Write well designed, testable, efficient code by using best software development practices
11. Stay plugged into emerging technologies/industry trends and applying them to operations, and activities
1. are available for full time (in-office) internship
2. can start the internship between 7th May'19 and 6th Jun'19
3. are available for duration of 6 months
4. have relevant skills and interests
Added requirements
1. Applicants must have a strong computer science background with acumen in data analytics, problem-solving, and research
2. Any relevant course on data mining/data science would be an added advantage
3. Male candidates and 2019 pass-outs preferred
4. Willingness to travel across India for field visits (3-4) in initial days to understand the data process
Perks
Certificate Letter of recommendation Flexible work hours Informal dress code 5 days a week
Additional information
This project aims to contribute to India’s health data ecosystem by strengthening data quality by: 1) initiating and nurturing a National Data Quality Forum (NDQF) and through 2) quality analytics for accurate, and evidence-based decision-making. It would enable recommending strategies to improve data quality in large scale surveys and monitoring mechanisms on indicators linked to sustainable development goals (SDGs), and enhancing capacities to implement data quality assurance through a series of activities and resources (e.g. workshops, web-based forums for sharing best practices, manuals for implementing data quality assurance). The project will also use data analytics to inform dialogue and action for data quality and guide programming and policy decisions. As a result, this project will enhance two key components of India’s data value chain: data collection and data analytics for decision-making.
The Indian Institute of Technology, Bombay (IITB) is one of the fifteen higher institutes of technology in the country, set up intending to make facilities available for higher education, research, and training in various fields of science and technology. Professor Ganesh Ramakrishnan (department of CSE) and professor Ramasubramanian (department of humanities and social sciences) are attempting to significantly speed up the process of digitization of Sanskrit texts. Enabled by the OCR and post-editing related technologies developed at IIT Bombay, they are now seeking the participation of the community of Sanskrit lovers, software developers, machine learning enthusiasts, project managers, etc.