Contact Us

Senior Data Engineer

Job Title
Senior Data Engineer
Job ID
27628581
Work From Home
Yes
Location
Remote, 
Other Location
Description

Who We Are

M2GEN is seeking people who are passionate about making a difference in the lives of cancer patients and who want to join a science-focused and purpose-driven team. In order to be a leader in creating and delivering health informatics solutions through evidence-based approaches to predict and meet the needs of cancer patients, we need an all-star team. Be part of the cure by joining M2GEN's team and impacting the future of cancer care.

Role / What you will be doing

Do you have a passion for big data and an interest in being on the forefront of precision medicine? Do you want to actively participate in the fight against cancer? If so, come join us to help scale and modernize M2GEN’s data platform, which will help research hospitals, pharmaceutical and bio-technology companies combat cancer. You’ll partner across engineering, architecture and the clinical and bioinformatics teams to deliver the best-in-class platform. You are a leader that continuously strives to raise the bar for the engineering organization.

Responsibilities (this is not an all-inclusive list; duties may evolve over time as business needs change)

● In partnership with other Senior and Principal engineers, lead the design of the next-generation precision-medicine data platform and products to help researchers win the fight against cancer. Partner with the business and other technology leaders to do so with an open mind and avoid dogma.

● Hands-on engineering of the platform, modeling what great code and teamwork looks like, in partnership with other engineers and teams.

● Foster a culture of support and growth through coaching/mentoring teammates, documenting your work, help teams break down complex systems and projects to understandable components, and actively participate in design and code reviews.

● Understand our customers and our business needs by collaborating with Product Managers, Clinical and Molecular Data Scientists, and other Engineering teams to actively figure out what is required to add value and help own the end-to-end solution with an open mind.

● Create an environment of inclusive excellence, psychological safety, compassionate directness and continuous improvement.

Education / Experience

● 6+ years minimum experience in Software Engineering, Design or Development.

● 4 or more years of experience delivering Data Lake/Warehousing solutions using high-scale distributed system tools and techniques.

● Experience developing software in one or more programming languages (Python, R, Scala/Java, Go, JavaScript/TypeScript, C#, etc) and comfort in a multi-language/polyglot environment.

● 3 or more years of delivering cloud-based software solutions, in AWS, Azure, Google Cloud Platform or other. Azure experience is a plus.

Knowledge / Skills / Abilities

● Strong engineering with a passion for delivering software designs and implementations in partnership with the team that are reliable, well-tested, high-performance, high-value and high-impact.

● Experience with modern data pipelining systems such as Apache AirFlow, Azure Synapse, AWS Data Pipeline, Databricks Delta, etc. Azure experience preferred.

● Experience with various databases and types in the industry including MPP (Snowflake, Azure Synapse, Redshift, BigQuery, etc), traditional SQL (PostgreSQL,MariaDB,Azure SQL, etc) NoSQL databases (DynamoDB, CosmoDB, Cassandra, etc), Data Warehouse design, BI reporting and dashboard development. Azure-based data management systems preferred.

● Experience integrating with and customizing analytics visualization systems, such as PowerBI, Tableau or Qlik. Experience with cBioPortal a plus.

● Experience delivering in compliance and regulatory environments that require high data security and a focus on data privacy. Specific knowledge in managing PII/PHI and HIPAA/HiTrust a plus.

● Demonstrated experience with modern software development concepts including operations-first delivery via DevOps/GitOps, automated testing/CI/CD pipelines, zero-downtime deployments, and SCRUM/Kanban methodologies.

● Foster an environment that encourages autonomy and psychological safety.

● Enjoy mentoring, coaching and delivering compassionate, direct feedback to team members as well as receiving and acting on feedback from others.

● Understand the value of diverse teams, and champion practices for Diversity, Equity and Inclusion.

● A passion for continuous learning, curiosity, personal accountability, teamwork, strong problem-solving skills, critical thinking and good judgement.

● Experience with data quality/diagnostics and lineage systems (Apache Atlas/Ranger/Griffin, Deequ, Azure Purview, etc) is preferred.

● Machine Learning processes and platforms info a plus (model training/serving, model management, feature store engineering, etc).

● Interest in Oncology is also a plus.

Our Values

Put Patients First– because they inspire us

Be Bold – be proud but not satisfied, be trailblazers

Create Knowledge – transform data into wisdom through shared information and team science

Lead by Example – Support and encourage others to work and grow in a way that brings out their best

Join the Conversation – Collaborate openly and honestly. Build community and relationships that last

Option 1: Create a New Profile