Back To All Jobs Software Engineer - Data Engineering
San Francisco, CA
Our vision is to bring more innovation, efficiency, and equality of opportunity to the world by creating an open financial system. Our first step on that journey is making digital currency accessible and approachable for everyone.
Two principles guide our efforts. First, be the most trusted company in our domain. Second, create user-focused products that are easier and more delightful to use than any alternative. Those principles guide every decision across the company from design through engineering, from operations through security. One key ingredient for making informed decisions is reliable and timely access to data and that's where you come in.
You will get the chance to build our next generation of data and machine learning pipelines and scoring systems from the ground-up. Our data pipeline moves several terabytes of data from our production database (Mongo) to analytical database (Redshift). We use machine learning to detect a variety of bad-actors on our platform including payment-fraudsters, risky users from a compliance perspective, users providing fake IDs, etc.
ETL pipeline : Maintain and build our next generation Extract Transform Load (ETL) pipeline. Your specific challenge would be to build this for both scale (handle 10x data) and speed (ensure 1 minute or less of lag time)
ML pipeline : Redesign our Machine Learning (ML) pipeline using Apache Spark
Deep Learning pipeline: Build a deep learning pipeline for image classification tasks like detecting fake and photoshopped IDs
ML scoring: Build a micro-service that allows us to get a user's risk score in 100 msec or less
Exhibit our core cultural values: add positive energy, communicate clearly, be curious, and be a builder
Experience building at least one big-data pipeline in production
Deep knowledge of at least one of the following big-data databases e.g., Spark, Hadoop, Hbase, Cassandra, DynamoDB
Experience building micro-services
Preferred (not required):
Computer Science or related engineering degree
Experience with Machine Learning a plus, but not required
What to send
A resume and a link to your GitHub or blog post showcasing something awesome you've built