Associate Data Engineer (Remote, US)
Apply NowLocation:
US
Company:
Sayari is the transparency company providing immediate visibility into complex commercial relationships through global ownership and trade data.
Summary:
The Associate Data Engineer will build and maintain ETL pipelines and improve data processing for Sayari’s Graph application. Applicants need expertise in programming, SQL databases, and experience in ETL pipeline management.
Requirements:
Hard Skills: Python, Java, Scala, SQL, ETL Pipelines
Job Description:
POSITION DESCRIPTIONSayari provides instant access to structured business information from hundreds of millions of corporate, legal, and trade records for a variety of use cases. As a member of Sayari's data team you will work with our Product and Software Engineering to build the graph that underlies Sayari’s products.
Please note that we cannot provide H1B and/or Visa Sponsorship for this role at this time.
Job Responsibilities
- Build and maintain ETL pipelines to process and export record data to Sayari Graph application
- Develop and improve entity resolution processes
- Implement logic to calculate and export risk information
- Work with product team and other development teams to collect and refine requirements
- Run and maintain regular data releases
Required Skills & Experience
- Expertise with Python or a JVM programming language (e.g. Java, Scala)
- Expertise with SQL (e.g., Postgres) databases
- 0-2+ years of experience designing, maintaining, and orchestrating ETL pipelines (e.g., Apache Spark, Apache Airflow) in cloud based environments (e.g., GCP, AWS, or Azure).
Additional Preferred Skills & Experiences
- Experience with entity resolution, graph theory, and/or distributed computing
- Experience with Kubernetes
- Experience working as part of an agile development team using Scrum, Kanban, or similar