Lead Data Engineer

Contract TypeDirect Hire

Job LocationNew York, New York

A company based in New York is looking for a Lead Data Engineer. You will team up with talented architect and developers. You will need to work closely with application developers to position our data strategy with business requirements.

What we're looking for.

You are a software developer with extensive experience who will work particularly in data modeling, distributed computing and ETL processes. You will need to facilitate with data lakes, and warehouses and a variety of databases, and you need to know how to orchestrate and automate big data processes

Job Qualifications:

  • Professional software development expertise of at least four years
  • Strong fundamentals in computer science.
  • Expertise in software development, in a server-side language (Python, Java, Scala, etc.)
  • Experience with no-SQL databases and designed relational schemas. You also need to have an in-depth knowledge with data storage formats (JSON, Parquet, Avro, etc.)
  • Skilled in indexing and tuning databases plus including column-based systems.
  • Expert in SQL
  • Experience working with distributed computing frameworks such as Spark, Hadoop, etc.
  • You are in charge of your own cloud resources
  • Designed ETL pipelines and data infrastructure.
  • Preferred Qualifications:

  • Python is your preferred programming language.
  • You've had an extensive practice with PySpark.
  • Knowledge of entity resolution and graph theory, as well as a background in data science.
  • You're interested in application development.
  • AWS familliarity.
  • Equal Opportunity Employer/Veterans/Disabled

    To read our Candidate Privacy Information Statement, which explains how we will use your information, please navigate to https://www.lhh.com/us/en/candidate-privacy

    The Company will consider qualified applicants with arrest and conviction records

    RefUS_EN_27_918942_2933787

    Lead Data Engineer

    LHH

    25 days ago

    Contract Type

    Direct Hire

    Job Location

    New York, New York

    A company based in New York is looking for a Lead Data Engineer. You will team up with talented architect and developers. You will need to work closely with application developers to position our data strategy with business requirements.

    What we're looking for.

    You are a software developer with extensive experience who will work particularly in data modeling, distributed computing and ETL processes. You will need to facilitate with data lakes, and warehouses and a variety of databases, and you need to know how to orchestrate and automate big data processes

    Job Qualifications:

  • Professional software development expertise of at least four years
  • Strong fundamentals in computer science.
  • Expertise in software development, in a server-side language (Python, Java, Scala, etc.)
  • Experience with no-SQL databases and designed relational schemas. You also need to have an in-depth knowledge with data storage formats (JSON, Parquet, Avro, etc.)
  • Skilled in indexing and tuning databases plus including column-based systems.
  • Expert in SQL
  • Experience working with distributed computing frameworks such as Spark, Hadoop, etc.
  • You are in charge of your own cloud resources
  • Designed ETL pipelines and data infrastructure.
  • Preferred Qualifications:

  • Python is your preferred programming language.
  • You've had an extensive practice with PySpark.
  • Knowledge of entity resolution and graph theory, as well as a background in data science.
  • You're interested in application development.
  • AWS familliarity.
  • Equal Opportunity Employer/Veterans/Disabled

    To read our Candidate Privacy Information Statement, which explains how we will use your information, please navigate to https://www.lhh.com/us/en/candidate-privacy

    The Company will consider qualified applicants with arrest and conviction records

    Personal Details
    Add Resume/ CV *
    Files must be in .doc, .docx or PDF and must be no larger than 4MB

    Or

    Use Dropbox
    Use Google Drive