Data Engineer

Engineering · REMOTE, Connecticut
Department Engineering
Employment Type Full-Time
Minimum Experience Experienced

Data Engineer

Engineering

 

About Catalytic Data Science (CDS):

Catalytic Data Science is a groundbreaking cloud R&D platform designed to integrate the volumes of scientific resources, data, and analytic tools while providing the ability to network with colleagues in one secure and scalable environment.   By enabling R&D teams to work more collaboratively and improving productivity company-wide, the Catalytic platform helps teams achieve key R&D milestones faster and with greater accuracy.  Our customers are passionate about making the world a better place, and we are inspired by the opportunity to help them.

 

The Role: 

You are a Data Engineer with experience in processing terabytes of data. You have experience in creating and automating scalable, fault-tolerant and reproducible data pipelines using Amazon AWS technologies. You are interested in helping to create a platform completely built on top of AWS. You are eager to join a team of Life Scientists and Software Engineers that believe the brightest minds in research should have the best tools to drive innovation.

 

 

What You’ll Do:

 

  • Build & operate automated ETL pipelines that process terabytes of text data nightly
  • Develop service frontends around our various backend datastores (AWS Aurora MySQL, Elasticsearch, S3)
  • Perform technical analyses and requirements specification with our product team on data service integrations
  • Help customers bring their data to the platform

 

What You Know:

 

Must Haves:

 

  • Python 3 or Java programming experience, preferably both
  • Day-to-day experience using AWS technologies such as Lambda, ECS Fargate, SQS, & SNS
  • Experience building and operating cloud-native data pipelines
  • Experience extracting, processing, storing, and querying of petabyte-scale datasets
  • Familiarity with building and using containers
  • Familiarity with event-based microservices

 

 Nice-to-Haves:

 

  • Prior experience with Elasticsearch (custom development and/or administration) is a huge plus
  • Prior work with text and natural-language processing
  • Knowledge of Graph databases

 

What do we love in team members?

 

Your specialization is less important than your ability to learn fast and adapt to shifting technologies. We’re especially fond of people who:

 

  • Focus on customer’s needs and our company’s goals, not just writing code
  • Iterate until customers love what you’ve built
  • Self-start and initiate
  • Self-organize
  • Strive to grow personally and professionally, beyond just expanding technical abilities
  • Love to experiment with new technology and share knowledge with the team

 

 

 In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire.

 

Thank You

Your application was submitted successfully.

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

  • Location
    REMOTE, Connecticut
  • Department
    Engineering
  • Employment Type
    Full-Time
  • Minimum Experience
    Experienced