Data Engineer
Job Description
Emerge is seeking a Data Engineer for our client, a technology company with a public safety mission.
The Data Engineer will help to build and maintain our client's data infrastructure to support reporting, analytics, and data science. The right candidate will have strong data architecture, ETL, and SQL skills and a proven track record partnering with both the business and data team colleagues to construct a framework which delivers on all of their needs. The candidate will also have strong operational skills to drive efficiency and speed, expertise in building repeatable data engineering processes, strong project management skills, and a vision for how to deliver data products.
Position Responsibilities:
- Create and orchestrate data pipelines in state-of-the-art AWS environment (Redshift, EC2, EMR, S3, Lambda, etc. with AWS Glue)
- Manage all aspects of the data and analytics system from ingestion to ETL to aggregate tables for analytics and reporting needs
- Design and implement internal data pipeline jobs / process improvements using various modern techniques: automating manual processes, optimizing data delivery, re-designing infrastructure for scalability, etc.
- Write scripts to schedule data ingestion and syncing. Evaluate, lead and form backend logic to create data marts from requirements for the purposes of self-serving stakeholders
- Assist data architects to build the working framework to create metrics as code and optimize data search and retrieval
- Troubleshoot data jobs / processes in production to fix data quality bugs or pipeline performance issues
- Build and assemble large, complex data sets with multi-dimensional relationships that meet both functional and non-functional requirements from business stakeholders
- Build integrations between systems
- Bachelor's degree or higher in a quantitative/technical field (e.g., Computer Science, Statistics, Engineering) or equivalent experience
- 5+ years of relevant experience in data engineering, database engineering, business intelligence or business analytics
- 5+ years of SQL knowledge for various reporting and transformation needs (Redshift, MySQL, PostgreSQL, Snowflake, Databricks)
- 4+ years of experience in core languages such as Python (experience building classes preferred)
- 4+ years of experience with schema design and dimensional data modeling
- An interest in understanding the business and its strategy, not just the data architecture, and how our work contributes to meeting business goals
- Experience with Data Lake architectures, and with combining structured and unstructured data into unified representations.
- Experience with API design and development of RESTful web services
- Analytical mindset with the ability to structure and process qualitative data and draw insightful conclusions.
- Data science and machine learning experience a plus
- GIS experience a plus
- Data visualization experience with tools like Tableau a plus
- Experience with JIRA preferred
- Must be an intellectually curious self-starter and motivated to continually learn
- Fluent in English; read, write and speak
- Remote, work from home, based in Mexico
- Monday-Friday work week and must work Eastern US office hours
- Applicants must reside in Mexico and be authorized to work in Mexico