cover image
IT Minds LLC

AI Optimization Engineer || NYC, NY(Onsite) ||

On site

New york, United states

Freelance

16-02-2026

Share this job:

Skills

Python JavaScript HTML CSS Data Analysis SQL Data Cleaning MySQL MongoDB GitHub Docker Kubernetes Jenkins Architecture Linux System Administration Machine Learning PyTorch Scikit-Learn TensorFlow Deep Learning Computer Vision Regression Programming angular AWS Numpy PL/SQL Flask Redis Large Language Models Keras Terraform Prometheus Grafana Matplotlib seaborn Microservices NLP

Job Specifications

Title: AI Optimization Engineer

Duration: 6 Months

Location: NYC, NY(Onsite)

Long Term Contract

Only W2 (USC OR GC)

Qualifications

Proficiency in languages such as Python, with experience in libraries like NumPy and scikit-learn.

Knowledge of various machine learning algorithms, including supervised and unsupervised learning, neural networks, decision trees, clustering, and dimensionality reduction.

Experience with deep learning frameworks such as TensorFlow, PyTorch, or Keras, and knowledge of their architectures and APIs.

Proficient with SLURM workload manager with REST and Flask APIs for automated and secure job scheduling.

Experienced in scalable infrastructure for deploying and managing large language models (LLMs),

HPC engineer with hands-on experience designing and managing GPU-accelerated clusters for large-scale AI/ML workloads.

Experience with deploying machine learning models in production environments, including containerization, microservices, and API design.

Leveraging Prometheus and Grafana to collect and analyze metrics, identify performance issues, and implement fixes. Experience creating Slurm and Triton metrics will be a plus.

Familiarity with Triton Inference Server, including its architecture, configuration, and deployment.

Knowledge of model optimization techniques, including pruning, quantization, and knowledge distillation.

Exploratory Data Analysis - Plotly, Seaborn, matplotlib

Deep Learning, Neural Networks, Decision Trees, Ensemble Methods, Gradient Boosting, Support Vector Machines, Random Forest, Logistic Regression, Transfer learning, Transformer based models, BART, Hyperparameter Tuning, Gen-AI, CNN, Computer Vision, NLP

Tools and Platforms like - Docker, Kubernetes, Jupyter, MLFlow, Github, Terraform, Jenkins, HuggingFace

Flask API Development and Security

Container Runtimes: Enroot, Pyxis, Podman

Linux (RHEL/CentOS) System Administration

Model Optimization techniques using Triton with TRTLLM

Desired Qualifications

Experience with data cleaning, feature scaling, and normalization

Programming skills creating UI/UX using the Angular framework, HTML, CSS, and JavaScript

Creating vector embeddings

Tools and Platforms like - AWS (SageMaker, Lambda, EC2)

Database Technologies – Oracle, MS-SQL, MongoDB, Redis and MySQL

SQL and PL/SQL Scripting

karthik@itminds.net

About the Company

IT Minds LLC provides the resources for long and short-term contracts. It also offers various product services. Information Technology , Pharmaceutical , Regulatory Affairs and Health Care Staffing, Product Development, On-site Customer Services, etc. are some of our services. We have over 15 years of experience placing information technology professionals in permanent positions and consulting assignments. At any level, on any platform, we provide quality professionals in quality positions. We always look forward to long-... Know more