Data Engineer Job at VDart Inc, Remote

MTEraHhkeGl3V1BLak1FZkphZVA1di9uNUE9PQ==
  • VDart Inc
  • Remote

Job Description

Title: Data Engineer

Location: Remote

Duration: 6 Months

Work Description:

We are in the process of migrating off CRMA Data Manager by rewriting queries and implementing the required data transformations in AWS. This platform modernization effort includes working through a backlog of datasets that must be migrated to AWS and transformed to meet current and future reporting needs.

Business Knowledge:

Limited business knowledge is needed.

Technical Skills:

Must-Have Technical Skills:

  • AWS Data Services (Hands-on)
  • S3: Data lake design, partitioning strategies, lifecycle management
  • IAM: Roles & policies, least-privilege access, cross-account access
  • Glue / EMR: Crawlers, Data Catalog, ETL job development
  • Athena: Querying data lakes with performance and cost optimization
  • Lake Formation: Basic governance and permission management

Compute & Processing

  • Apache Spark (PySpark): Batch processing, performance tuning, joins, partitioning
  • Python: Production-grade coding (packaging, testing, logging, type hints)
  • SQL: Advanced querying (window functions, query optimization, data modeling support)

Orchestration & Scheduling

  • Airflow / MWAA / AWS Step Functions
  • DAG design
  • Retry mechanisms
  • SLA management
  • Backfills
  • Data Warehousing & Modeling
  • Redshift / Snowflake (on AWS): Fundamentals and performance considerations
  • Dimensional Modeling: Star/Snowflake schema design

ETL/ELT Patterns:

  • CDC (Change Data Capture)
  • SCD (Slowly Changing Dimensions)
  • Idempotent data pipelines
  • Data Reliability & Observability
  • Data quality frameworks: Great Expectations / Deequ (or equivalent)
  • Data reconciliation & validation
  • Monitoring & observability: CloudWatch logs, metrics, alerts

DevOps & Delivery

  • Version Control: Git, branching strategies, code reviews
  • CI/CD: Data pipeline automation (e.g., GitLab CI/CD)
  • Infrastructure-as-Code: OpenTofu / CloudFormation for AWS resource deployment

Security & Compliance

  • Encryption: At rest & in transit (KMS)
  • Secrets management: AWS Secrets Manager / SSM
  • Networking fundamentals: VPC, private subnets, endpoints (data access control)

Role Expectations (Hands-on Experience Required):

  • Designed, developed, and maintained production-grade ETL pipelines using AWS Glue (PySpark)
  • Built scalable data ingestion pipelines from S3, databases, and streaming sources into S3 data lakes
  • Implemented complex transformations and joins in PySpark, optimizing performance (partitioning, broadcast joins, caching)
  • Developed incremental and idempotent pipelines, including handling CDC and SCD
  • Automated schema discovery using Glue Crawlers and Data Catalog
  • Tuned Glue Spark jobs for performance, concurrency, and cost efficiency
  • Integrated pipelines with orchestration tools like Airflow (MWAA) or Step Functions
  • Collaborated with data teams to load curated data into Redshift / Snowflake / Iceberg for analytics
  • Implemented data quality checks using built-in validations or tools like Great Expectations / Deequ
  • Applied AWS security best practices (IAM roles, KMS encryption, secure data access)
  • Contributed to CI/CD pipelines for Glue job deployment using Git and IaC tools
  • Monitored pipelines using CloudWatch, ensuring reliability and quick incident resolution
  • Worked closely with stakeholders to define data contracts, SLAs, and business expectations

Key Skills: Data Engineer, AWS Glue, IAM, ETL, Athena, PySpark

Job Tags

Full time

Similar Jobs

Naver U.Hub INC

ThingsBook - Creator & Content Specialist Job at Naver U.Hub INC

 ...benefiting from the stability and resources of a global technology leader. Key Responsibilities: We are looking for a Creator & Content Specialist to build and scale the content and creator ecosystem at ThingsBook. This role goes beyond traditional content or... 

Cornerstone School

School Secretary Job at Cornerstone School

 ...Cornerstone Mandarin Immersion Program is looking for good fit profession to joining our growing community. The School Secretary plays a vital role in the smooth operation of the school, providing administrative support to the principal, staff, students, and parents... 

American Construction Group, Inc

Pilot Car Driver Job at American Construction Group, Inc

 ...American Construction Group, Inc is hiring Pilot Car/Escort Drivers. Job Overview: Join our team as a Pilot Car Escort Driver and play a vital role in ensuring the safe and efficient transportation of oversized loads. In this position, you will lead the way for... 

Nation Security

Front Desk Receptionist - Bilingual English/ Spanish Job at Nation Security

 ...the Role Nation Security is seeking a Bilingual Front Desk Receptionist who is...  ...candidate is fluent in both English and Spanish, able to multitask efficiently, and thrives...  ...service. Key Responsibilities Greet and assist visitors, employees, and clients with... 

Confidential

MARKETING INTERN Job at Confidential

 ...creation, and community engagement. The intern works closely with local association leadership...  ...high-quality content (graphics, photos, videos, captions) and manage regional social...  ...teams. Capture video content of games and special events as assigned. Assist...