Company
Ancestry
Title
Cloud Engineer - DNA Science
Apply here
Location
Remote – USA
Job Description
What You’ll Do:
- Streamline manual processes to improve quality, reproducibility, and access to data.
- Build and maintain data systems and pipelines.
- Harmonize and transform raw information from different sources to create consistent and machine-readable formats.
- Develop and test architectures that enable data extraction and transformation for predictive or prescriptive modeling
Who You Are:
- Previous experience as a data engineer or in a similar role
- Technical expertise with data models, data mining, and segmentation techniques
- Knowledge of Python programming languages
- Hands-on experience with complex SQL queries and database design
- Experience developing interactive visualization tools to support non-technical stakeholders (e.g., RShiny, Streamlit, Tableau, web-based applications)
- Experienced with Spark/Hadoop experience for big data analytics
Required Skills and Qualifications:
- Master’s degree in stats, physics, computer science, or related discipline
- Minimum 2 year of experience with Python, SQL, and interactive data visualization/exploration tools
- Experience using Python to automate tasks
- Experience with Bash, Linux administration
- Communication skills, especially explaining technical concepts to non-technical business leaders
- Comfort working in a dynamic, research-oriented team with concurrent projects
- Familiarity with the AWS ecosystem
- Knowledge of infrastructure tools like Terraform and AWS Cloud Formation
- Experience working with biological data/scientists
- familiarity with workflow orchestration tools/languages to automate tasks and deploy pipelines