icohack logo

About the Client

Software Engineer - Data Engineering
coinbase company
United States

Software Engineer - Data Engineering

  • Price per hour
  • Deadline (not set)
  • Category Technology
Back To All Jobs Software Engineer - Data Engineering San Francisco, CA Our vision is to bring more innovation, efficiency, and equality of opportunity to the world by creating an open financial system. Our first step on that journey is making digital currency accessible and approachable for everyone. Two principles guide our efforts. First, be the most trusted company in our domain. Second, create user-focused products that are easier and more delightful to use than any alternative. Those principles guide every decision across the company from design through engineering, from operations through security. One key ingredient for making informed decisions is reliable and timely access to data and that's where you come in. You will get the chance to build our next generation of data and machine learning pipelines and scoring systems from the ground-up. Our data pipeline moves several terabytes of data from our production database (Mongo) to analytical database (Redshift). We use machine learning to detect a variety of bad-actors on our platform including payment-fraudsters, risky users from a compliance perspective, users providing fake IDs, etc. Responsibilities ETL pipeline : Maintain and build our next generation Extract Transform Load (ETL) pipeline. Your specific challenge would be to build this for both scale (handle 10x data) and speed (ensure 1 minute or less of lag time) ML pipeline : Redesign our Machine Learning (ML) pipeline using Apache Spark Deep Learning pipeline: Build a deep learning pipeline for image classification tasks like detecting fake and photoshopped IDs ML scoring: Build a micro-service that allows us to get a user's risk score in 100 msec or less Requirements Exhibit our core cultural values: add positive energy, communicate clearly, be curious, and be a builder Experience building at least one big-data pipeline in production Deep knowledge of at least one of the following big-data databases e.g., Spark, Hadoop, Hbase, Cassandra, DynamoDB Experience building micro-services Preferred (not required): Computer Science or related engineering degree Experience with Machine Learning a plus, but not required What to send A resume and a link to your GitHub or blog post showcasing something awesome you've built
2,018 years ago


Software Engineering
Software Development