About MetroGhar Property Solutions Private Limited
MetroGhar is a Bangalore-based start-up specializing in the real estate market. Our vision is to make a standardized and easy medium for builders/developers to sell their inventory to potential clients.
1. Work on crawling, extracting, and processing data (e.g. Scrapy, pandas, MapReduce, SQL, BeautifulSoup, etc.)
2. Gather and process raw data at scale (including writing scripts, web scraping, calling/creating APIs, etc.) from the web/internet
3. Develop the capability to efficiently scrape data from the web from multiple sources
4. Scrape difficult websites by deploying anti-blocking and anti-captcha tools
5. Develop various RestFul APIs & Integrate them with various data sources
6. Develop tools & techniques related to data extraction from web or PDF files and other process automation
7. Develop frameworks for automating and maintaining a constant flow of data from multiple sources
8. Optimize the scraping capability to ensure the data is scrapped efficiently with the minimum usage of server bandwidth
Skills and other requirements:
1. Good experience in web scraping frameworks Scrapy, Selenium, SQLAlchemy, Pandas, Beautiful Soup, or other frameworks and related libraries
2. Experience in creating scrapy spiders for websites with Captcha, IP ban, Geolocation ban, Cloudflare/Imperva firewalls, sites requiring a login to access data, dynamic websites loading through JS/REST API/Graphql, etc.
3. Good knowledge of various Python Libraries, Web Scraping, Automations, APIs, and toolkits
4. Basic understanding of front-end technologies, such as JavaScript, HTML5, and CSS3