We at Procal are looking for a savvy Machine Learning & Data Engineer to join our team of analytics experts to help us extract value from our data. You will lead all the processes from data collection, cleaning, and preprocessing, to training models and deploying them to production. We are looking for very hands-on engineers with good experience in big data, data architecture, machine learning, and LLM. The ideal candidate will be passionate about artificial intelligence and stay up to date with the latest developments in the field. This position will combine typical Data Scientist skills with research, advanced business communication, and presentation skills.
Job Overview:
- Job ID: PROCLT0015
- Job Title: Data Engineer
- No. of Positions: 3
- Location: Basking Ridge NJ
- Client: VZ
- Experience: 7 to 10 years
- Working Model: Onsite, Hybrid
- Job Type: Contract Position
Key Responsibilities:
- Develop big data scalable solutions using Hadoop, Hive, Spark, Map-Reduce, Java, and Python.
- Design schema for NoSQL Database & Data Warehouse, and develop ETL data flows and Cloud Integration for reporting solutions.
- Assemble large, complex data sets that meet functional and non-functional requirements, and identify process improvements by automating manual processes and optimizing data delivery.
- Build infrastructure for optimal extraction, transformation, and loading of data using SQL and Spark technologies.
- Gather, analyze, and draw conclusions from large, diverse data sets to contribute to secure, stable application development, and verify data quality.
- Understand business objectives, develop models to achieve them, and track their progress.
- Design, develop, and research Machine Learning systems, models, and schemes, while managing available resources to meet deadlines.
- Perform statistical analysis to improve models, train ML systems as needed, and analyze ML algorithm use cases.
Key Skill Sets:
- Good communication and presentation skills with teamwork experience.
- Proficiency in R and/or Python; deep learning framework experience (e.g., TensorFlow or Keras) and machine learning libraries (e.g., scikit-learn, pandas).
- Hands-on experience in Java, Scala, and/or Python, with system design and application development.
- Experience across the software development life cycle and familiarity with agile methodologies such as CI/CD.
- Knowledge of Unix shell, SQL, NoSQL DBs, Linux, Spark, and Kafka.
- Understanding of Large Language Models from a systems engineering perspective.
Qualifications:
- MS or PhD in a relevant field (e.g., Computer Science, Engineering, Statistics, Physics, Applied Math).
- 5+ years of experience with Python for data analysis and model training/deployment.
- 3+ years of experience with ML frameworks (e.g., PyTorch, TensorFlow) and machine learning/statistical modeling.
- 1+ year of experience with technologies related to large language models, including LLM architectures and model evaluation.
- Proficient in design and deployment of LLM-powered agents and tools, including prompt engineering and retrieval optimization approaches.
How to Apply:
Interested candidates are invited to submit their resume & cover letter to careers@procaltech.com. Procal Technologies Inc. is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.