We at Procal are looking for a savvy Machine Learning & Data Engineer to join our team of analytics experts to help us extract value from our data. You will lead all processes from data collection, cleaning, and preprocessing to training models and deploying them to production. We are looking for very hands-on engineers with strong experience in big data, data architecture, machine learning, and large language models (LLM). The ideal candidate will be passionate about artificial intelligence and stay up to date with the latest developments in the field. This position will combine typical Data Scientist math and analytical skills with research, advanced business communication, and presentation skills.
Job ID: PROCLT0015
Job Title: Data Engineer
No. of Positions: 3
Location: Basking Ridge, NJ
Client: VZ
Experience: 7 to 10 years
Working Model: Onsite, Hybrid
Job Type: Contract Position
Key Responsibilities:
- Develop scalable big data solutions using Hadoop, Hive, Spark, Map-Reduce, Java, and Python.
- Design schema and data modeling for NoSQL databases and Data Warehouses.
- Develop ETL data flows and cloud integration to build reporting solutions.
- Assemble large, complex data sets to meet functional and non-functional requirements.
- Identify and implement internal process improvements, automating manual processes, optimizing data delivery, and redesigning infrastructure for scalability.
- Build infrastructure for optimal extraction, transformation, and loading of data from diverse sources using SQL and Spark technologies.
- Gather, analyze, and draw conclusions from large, diverse data sets to contribute to decision-making.
- Verify data quality and perform data cleaning, exploring and visualizing data for performance insights.
- Understand business objectives and develop models to achieve them, tracking progress with metrics.
- Design, develop, and research Machine Learning systems, models, and schemes.
- Perform statistical analysis to improve models and train/retrain ML systems as needed.
- Analyze use cases of ML algorithms.
- Build efficient pipelines to host LLM services for daily analysis and troubleshooting.
Key Skill Sets:
- Good communication and presentation skills; team player.
- Experience in R and/or Python; proficiency in deep learning frameworks such as TensorFlow or Keras.
- Expertise in visualizing and manipulating big datasets with strong Python libraries (e.g., scikit-learn, pandas).
- Hands-on experience in Java, Scala, and/or Python, as well as system design and application development.
- Familiarity with the entire Software Development Life Cycle and exposure to agile methodologies such as CI/CD.
- Knowledge of Unix shell, SQL, NoSQL databases, Linux, Spark, and Kafka.
- Understanding of Large Language Models from a system engineering perspective.
Qualifications:
- MS or PhD in a relevant field (Computer Science, Engineering, Statistics, Physics, Applied Math).
- 5+ years of experience with Python for analyzing datasets, training, evaluating, deploying, and optimizing models.
- 3+ years using ML frameworks such as PyTorch or TensorFlow.
- 1+ year of experience with LLM technologies and model evaluation, customization, and fine-tuning techniques.
- Adept at designing, deploying, and evaluating LLM-powered agents and prompt engineering.
- Master's degree in a quantitative field (statistics, data science, business analytics, economics, finance, engineering, or computer science).
How to Apply: Interested candidates are invited to submit their resume & cover letter to careers@procaltech.com.
Procal Technologies Inc is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.