iShare Inc.

Claim your share of IT Driven Value

AI Engineer – Agentic, RAG Systems

AI EngineerMachine Learning EngineerFull TimeRemoteTeam 11-50Since 2017H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

114 days ago

Salary

$130K / year

Bachelor DegreeEnglishFlaskGoogle Cloud PlatformPythonPy TorchScikit LearnTensorflow

Job Description

• Design, build, and operate agentic AI systems end-to-end—from concept to production. • Work on multi-agent orchestration, Retrieval-Augmented Generation (RAG), evaluation frameworks, and AI guardrails to build safe, reliable, and high-performing systems. • Collaborate cross-functionally with product, ML, and design teams—bringing ideas to life through strong engineering execution, clear communication, and a low-ego, problem-solving mindset. • Design and implement Retrieval-Augmented Generation pipelines to ground LLMs in enterprise or domain-specific data. • Make strategic decisions on chunking strategy, embedding models, and retrieval mechanisms to balance context precision, recall, and latency. • Work with vector databases (Qdrant, Weaviate, pgvector, Pinecone) and embedding frameworks (OpenAI, Hugging Face, Instructor, etc.). • Diagnose and iterate on challenges like chunk size trade-offs, retrieval quality, context window limits, and grounding accuracy—using structured evaluation and metrics. • Establish comprehensive evaluation frameworks for LLM applications, combining quantitative (BLEU, ROUGE, response time) and qualitative methods (human evaluation, LLM-as-a-judge, relevance, coherence, user satisfaction). • Implement continuous monitoring and automated regression testing using tools like LangSmith, LangFuse, Arize, or custom evaluation harnesses. • Identify and prevent quality degradation, hallucinations, or factual inconsistencies before production release. • Collaborate with design and product to define success metrics and user feedback loops for ongoing improvement. • Implement multi-layered guardrails across input validation, output filtering, prompt engineering, re-ranking, and abstention (“I don’t know”) strategies. • Use frameworks such as Guardrails AI, NeMo Guardrails, or Llama Guard to ensure compliance, safety, and brand integrity. • Build policy-driven safety systems for handling sensitive data, user content, and edge cases with clear escalation paths. • Design and operate multi-agent workflows using orchestration frameworks such as LangGraph, AutoGen, CrewAI, or Haystack. • Coordinate routing logic, task delegation, and parallel vs. sequential agent execution to handle complex reasoning or multi-step tasks. • Build observability and debugging tools for tracking agent interactions, performance, and cost optimization. • Evaluate trade-offs around latency, reliability, and scalability in production-grade multi-agent environments.

Job Requirements

  • Strong proficiency in Python (FastAPI, Flask, asyncio) and GCP experience is good to have
  • Demonstrated hands-on RAG implementation experience with specific tools, models, and evaluation metrics.
  • Practical knowledge of agentic frameworks (LangGraph, LangChain) and evaluation ecosystems (LangFuse, LangSmith).
  • Excellent communication skills, proven ability to collaborate cross-functionally, and a low-ego, ownership-driven work style.
  • Experience in traditional AI/ML workflows — e.g., model training, feature engineering, and deployment of ML models (scikit-learn, TensorFlow, PyTorch).
  • Familiarity with retrieval optimization, prompt tuning, and tool-use evaluation.
  • Background in observability and performance profiling for large-scale AI systems.
  • Understanding of security and privacy principles for AI systems (PII redaction, authentication/authorization, RBAC)
  • Exposure to enterprise chatbot systems, LLMOps pipelines, and continuous model evaluation in production.

Related Job Pages

More AI Engineer Jobs

AI Engineering Lead

Raspberry AI

Generative AI for the fashion industry

AI Engineer114 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

AI Engineering Lead developing generative AI capabilities at Raspberry AI.

AWSAzureCloudGoogle Cloud Platform
United States

Applied AI Engineer

AI Fund

Together, let's build great companies that move humanity forward.

AI Engineer115 days ago
Full TimeRemoteTeam 11-50Since 2017H1B No Sponsor

Applied AI Engineer developing document AI solutions at LandingAI

AWSCloudDockerKubernetes
New York

Senior Staff AI Engineer

NextGen Healthcare

NextGen Healthcare, Inc. is a leading provider of innovative healthcare technology and data solutions.

AI Engineer120 days ago
Full TimeRemoteTeam 1,001-5,000Since 1998H1B Sponsor

Lead AI architecture and systems delivery in healthcare domains

PythonPyTorchTensorflow
United States

Senior AI Engineer

Hercules

Hercules operates over 1000 pieces of equipment through 30 terminals and is an award winning asset based motor carrier.

AI Engineer121 days ago
Full TimeRemoteTeam 201-500H1B Sponsor

AI Engineer creating training environments and optimizing model behaviors

C++DockerKubernetes
United States