Data/Analytics Lead
Noida, Uttar Pradesh, India · Tempo pieno
Sii il primo a candidarti
- Esperienza
- 6 yrs
- Stipendio
- INR 2,500,000 – INR 3,200,000 / year
- Aperture
- 1
- Pubblicato
- 1 ora fa
- Work mode
- In ufficio
- Istruzione
- Computer Science Engineering
- Eligibility
- Candidates with a minimum of 6 years of experience and a Computer Science Engineering background can apply.
- Resume
- Required to apply
Where you'll work
Descrizione del lavoro
About the Role
This position is with a fast-moving technology company that is creating secure, sovereign, and scalable digital products across AI, Web3.0, cloud, and health-tech. The selected professional will lead the creation of a modern analytics ecosystem built entirely on open-source technology, including data pipelines, a lakehouse setup, and BI systems developed from the ground up. The role also involves managing a team of 4 to 6 data engineers.
Leadership and Strategy
- Guide and mentor a 4–6 member data engineering team.
- Own the architecture and long-term direction of the analytics platform.
- Set up the data platform from scratch using open-source components.
- Create and maintain governance, reliability, and data quality standards.
Platform Architecture
- Design a complete analytics stack on open-source technologies.
- Develop a data lakehouse using Cloudian object storage.
- Plan and tune ClickHouse for OLAP use cases with sub-second query performance.
- Build ingestion flows from APIs, databases, and SaaS tools.
- Support both batch and real-time data movement patterns.
Data Ingestion and ETL
- Develop pipelines with Airbyte, including CDC, connector setup, and validation.
- Coordinate workflows through Apache Airflow with scheduling, monitoring, and failure handling.
- Design load strategies for both incremental and full-refresh processing.
- Implement streaming workflows using Kafka.
Data Warehouse and Analytics
- Improve ClickHouse performance using materialized views and distributed tables.
- Design dimensional data models such as star and snowflake schemas.
- Build semantic layers to keep business metrics consistent across tools.
BI Platform
- Launch Apache Superset as a self-service analytics layer.
- Support Power BI integration where needed.
- Use Redis-based caching to improve dashboard speed and responsiveness.
Data Quality, Governance, and Compliance
- Set up data quality checks using tools like Great Expectations and Soda.
- Automate reconciliation and validation across datasets.
- Put data lineage, cataloging, and compliance processes in place, including GDPR-related requirements.
Automation and Optimization
- Develop Python-based automation scripts and custom connectors.
- Introduce CI/CD practices for data pipelines.
- Continuously improve cost efficiency and platform performance.
Additional Information
This is a full-time, onsite role based in Noida, Uttar Pradesh, India. The salary range is INR 25,00,000 to 32,00,000 per year. There is 1 opening. The role is expected to begin immediately. Applications are open only to candidates with at least 6 years of experience and a background in Computer Science Engineering. The deadline to apply is 30 September 2026, 23:59:59. Probation-related salary details were not specified. A job offer is listed as a perk.