This page was automatically translated and may contain errors. View in English.
Virtusa

Model Validator - Agentic AI

Virtusa

Andhra Pradesh, India · ಪೂರ್ಣ ಸಮಯ

ಅರ್ಜಿ ಸಲ್ಲಿಸುವವರಲ್ಲಿ ಮೊದಲಿಗರಾಗಿರಿ

ಅನುಭವ
ಯಾವುದೇ
ಸಂಬಳ
ತೆರೆಯುವಿಕೆಗಳು
1
ಪೋಸ್ಟ್ ಮಾಡಲಾಗಿದೆ
1 ಗಂಟೆ ಹಿಂದೆ

Where you'll work

ಕೆಲಸದ ವಿವರ

Role Overview

This position is for a Model Validator working on agentic AI systems. The role focuses on designing evaluation approaches, stress-testing agent behavior, and verifying that model outputs and tool usage align with business requirements.

Key Responsibilities

  • Build evaluation sets that can benchmark agent performance by tracing reasoning paths.
  • Carry out adversarial tests by presenting conflicting or challenging instructions to expose weak spots in the agent.
  • Run regression checks to measure how much agent behavior changes across test cycles.
  • Validate tool calls to ensure the agent is invoking the right external APIs and databases.
  • Review thought chains and pinpoint where the agent’s logic starts to drift from the business requirement document.
  • Apply judge LLMs to score and assess model outputs.
  • Use semantic debugging by analyzing the agent’s thought trace to identify decision issues.

Skills and Technical Expectations

  • Strong Python programming ability.
  • Hands-on familiarity with evaluation frameworks such as DeepEval and LangSmith.
  • Solid understanding of data- and SQL-driven testing methods.
  • Ability to work with model traces, reasoning paths, and output grading workflows.

Screening Criteria

  • SDET profiles are preferred because their coding and testing background suits evaluation development.
  • Prior domain exposure is an added advantage for this role.

Location and Work Mode

This is a full-time onsite opportunity based in Andhra Pradesh, India.

ನಿಮಗೆ ಪ್ರತ್ಯುತ್ತರ ಬೇಕಾದರೆ ಅದನ್ನು ಬಿಡಿ — ನಾವು ಅದನ್ನು ಬೇರೆ ಯಾವುದಕ್ಕೂ ಬಳಸುವುದಿಲ್ಲ.

ಬ್ರೌಸ್ ಮಾಡಲು ಕ್ಲಿಕ್ ಮಾಡಿ, ಎಳೆಯಿರಿ ಮತ್ತು ಬಿಡಿ, ಅಥವಾ ಅಂಟಿಸಿ ಸ್ಕ್ರೀನ್‌ಶಾಟ್

PNG, JPG, GIF, MP4, WebM, MOV · ಪ್ರತಿಯೊಂದೂ ಗರಿಷ್ಠ 20MB · 5 ಫೈಲ್‌ಗಳವರೆಗೆ