Century Health is at the forefront of transforming patient care through cutting-edge technology. Our mission is to accelerate patient access to breakthrough treatments by harnessing the power of AI to analyze real-world clinical data. By joining us, you become part of a dynamic team dedicated to developing this real-world data marketplace.
We are seeking a highly skilled and motivated senior data engineer to join our growing team! This role is crucial for developing and optimizing our data pipelines, and ensuring data quality and accessibility for advanced analytics and AI models. The ideal candidate will have a strong background in data engineering, with proven experience in data pipelining and orchestration, big data technologies such as Spark and Python, and familiarity with cloud infrastructure and database systems.
As one of the first hires to our fast-growing startup, you will be given a lot of responsibility and the opportunity to shape our product and data architecture from the ground up! Furthermore, you will receive direct mentorship from the CTO, work closely with our full-stack engineering team, and participate fully in all team events.
Key responsibilities:
1. Design, build, and maintain efficient, reliable, and scalable data pipelines, from raw data ingest to data cleaning to managing front-end outputs
2. Implement data orchestration workflows using Airflow or Kedro to manage ETL processes
3. Develop and optimize data processing tasks using Python and PySpark
4. Leverage AWS cloud services to enhance our data infrastructure's scalability and performance
5. Work on database management systems, such as MongoDB, Postgres, and RedCap
6. Collaborate with full-stack engineers to manage hand-off between data insights and front-end visualization
7. Create LLM integration for a Text2SQL and Text2Python front end
8. Ensure high-quality data governance and security practices are maintained
2. Candidate must be available to work from 6:00 pm - 9:30 pm Indian Standard Time (as the company is based outside of India & their local work timings are 8:30 am - 12:00 pm Eastern Standard Time)