About this role
A Data Engineer at Ruby Labs is responsible for building and maintaining a reliable data platform by developing ETL/ELT data pipelines, implementing monitoring and alerting systems, and establishing data quality checks. The role involves collaborating closely with the lead data engineer to enhance the data stack's observability, reliability, and scalability, utilizing technologies such as Python, SQL, Airflow, DBT, BigQuery, and ClickHouse. The Data Engineer will also ensure comprehensive documentation and efficient data structures to support data-driven decision-making across the organization.
Required Skills
- Data Pipeline Development
- Monitoring Systems
- Data Quality Checks
- Data Structures Maintenance
- Technical Documentation
- Stakeholder Collaboration
- Problem-Solving Skills
- Attention to Detail
- Python Proficiency
- SQL Skills
+6 more
Qualifications
- Proficiency in Python for data pipeline development
- Strong SQL skills and experience working with cloud data warehouses (ClickHouse, BigQuery preferred)
- Hands-on experience with DBT (modelling, testing, documentation, and deployment)
- Experience with workflow orchestration tools such as Airflow
- Familiarity with data quality frameworks (e.g. Great Expectations, DBT tests) and anomaly detection methods
- Experience building monitoring and alerting systems for data pipelines and data quality
- Ability to write clear, maintainable, and actionable technical documentation
About Ruby Labs
rubylabs.comRuby Labs is a forward-thinking company that specializes in developing innovative consumer products aimed at transforming sectors such as health, education, and entertainment. With a commitment to enhancing lives, Ruby Labs leverages cutting-edge technology to create solutions that are not only functional but also impactful. The company focuses on crafting products that push the boundaries of traditional markets, creating value for consumers and making a positive difference in their everyday lives.