Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.
This is a US-only, Remote role (Mainland).
Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.
Hire data engineers to aid you in that journey.
Design scalable data pipelines processing massive record volumes
Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)
Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch
Integrate new data sources into the main pipeline
Implement advanced data matching using Splink
5-8 years professional data engineering experience
Good proficiency in:
PySpark and distributed computing
AWS data services (EMR, Glue, Athena)
Docker
Pandas and DataFrame manipulation
Complex data format handling (JSONL, Parquet)
Strong background in:
Big data processing architectures
Data warehouse design
Performance optimization
Advanced Python, SQL skills
Probabilistic record linking expertise
OpenSearch/elasticsearch technologies
Machine learning data pipeline design
Recruitment tech ecosystem knowledge
Big Data: PySpark, EMR
Databases: Postgres, OpenSearch
Cloud: AWS
Containerization: Docker
Data Formats: JSONL, Parquet
Analytics: Metabase, Athena, Glue
Data Processing: Pandas, Splink
While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.
If you are coming from Director/Head of/VP levels that is relevant to this job, you can apply as well.
You will need to apply directly on our platform.
Thank you for your time.
...Internships at Optum. If you want an intern experience that will dramatically shape your career, consider a company thats dramatically... ...candidates that are not local to the Quincy, MA area. The Pharmacist Intern will be working as part of Genoa Healthcare , which...
Class A Tomato haul driver JobThe Morning Star Trucking Company provides transportation services for bulk tomato, freight and transplanting operations throughout the Central Valley of California, primarily during the summer months. Morning Star Trucking also provides...
Position Summary The position of IS Analyst -Business Intelligence is responsible for analysis, data validation and user acceptance of data and solutions with the goal of discovering useful information, suggesting conclusions, and supporting decision making. (***) ***-**** ...
The Chief Administrative Officer (CAO) is responsible for the strategic allocation of human and capital resources to support the tri-mission of the School of Nursing (SoN). The CAO leads the finance, operations, information technology, and facilities functions. The CAO...
...We are hiring a young and talented candidate to work as a Social Media Assistant for our Marketing company. As a Social Media manager Assistant... ...is a plus. Benefits: Remote Work: Enjoy the flexibility of working from home with a supportive team environment....