Data QA Engineer
About the position
Company Description: Ex Parte provides our customers with the data and insight to make smart and informed decisions on the most important legal issues facing their organizations. We are looking for talented, enthusiastic senior data engineers who share our passion for big data, AI, and machine learning and are excited by seemingly-impossible challenges. As an early employee, you must be amazingly entrepreneurial and thrive in a fast-paced environment where the solutions aren’t predefined. Every year, corporations spend more than $250B on litigation in the United States alone. And yet, critical decisions such as whether to litigate or settle, or where to file suit or which attorney to hire, are all made the same way they were 100 years ago. We are applying artificial intelligence, machine learning, and natural language processing to provide our customers with the insight they need to make highly informed decisions and gain a winning advantage.
Responsibilities
• Take ownership of end to end data quality
• Understand and contribute to the event model design
• Build and automate testing frameworks around data ingestion pipelines
• Write complex SQL queries on tables with hundreds of millions of records and ensure data integrity is maintained throughout the ETL lifecycle
• Design test cases and write Python/SQL scripts to validate data integrity and identify gaps and opportunities in our pipelines
• Track data issues and work with team leads from discovery to resolution
• Collaborate with analytic teams to conduct data quality investigations, improve automation and tools
• Review current tools and enhance them to help with data integrity
Requirements
Minimum Qualifications:
• 5+ years of work experience in QA, preferably in data or relevant space
• Demonstrable knowledge, experience, skill, and proficiency with Scrum/Agile methodologies and SDLC
• Python (at least reading) and SQL
• Experience with QA tests such as functional progression & regression, integration, performance, load, UAT, and operational readiness testing
• Must be self-motivated, able to work independently, and thrive in a fast-paced, multi-tasking environment while maintaining excellent working relationships across functions
• Excellent verbal and written communication skills
Nice-to-haves
Preferred Qualifications:
• Applied experience with Databricks and/or Azure ML
• Strong coding abilities in one or more scripting languages like Python or SQL
• Understanding of compliance, security, and risk domains and associated data elements
• Experience with vendor reporting solutions such as PowerBI or Tableau
• Understanding of product and services activation, use, and transaction models and data
• Understanding of statistical analysis and machine learning tools and practices
• Familiarity with cloud-centric data processing and visualization approaches including SQL and NoSQL databases (Azure SQL, Azure Cosmos DB, Data Factory, Synapse, Azure Data Lake, etc.)
• Familiarity with Agile software delivery and application lifecycle management tools (Jira/Azure DevOps/VSTS, Git)
Additional Information: All your information will be kept confidential according to EEO guidelines.