You’ve deployed an agent that helps users troubleshoot technical issues with their devices. After several weeks in production, user feedback indicates a decline in response accuracy, especially for newer issues.
Which monitoring method is most appropriate for identifying the root cause of declining agent performance?
When evaluating a customer service agent’s resilience to API failures and network issues, which analysis methods effectively identify weaknesses in error handling and retry mechanisms? (Choose two.)
An AI agent is being built to execute database queries, generate reports, and interact with cloud services.
Which design choice best improves long-term scalability and maintainability when adding new tools?
After a series of adjustments in a supply chain agentic system, the agent has dramatically reduced shipping times and minimized costs, but the team is receiving a high volume of complaints from customers regarding delayed deliveries.
Which metric is MOST important to prioritize when investigating this situation?
A recently deployed Agentic AI system designed for automated incident response within a cloud infrastructure has been consistently failing to identify and resolve ‘high-priority’ alerts – specifically, those related to increased CPU utilization across several virtual machines. Initial logs show the agent is primarily focusing on alerts with related network traffic spikes, ignoring the CPU metrics.
What is the most appropriate initial step for a senior Agentic AI engineer to take to resolve this issue, considering the system’s reliance on benchmarking and iterative improvement?
You’re developing an agent that monitors social media mentions of your brand. The social media platform’s API returns data mentioning your brand with varying confidence scores that the brand was actually being mentioned, but these scores aren’t consistently calibrated.
Considering the unreliability of these confidence scores, what’s the most reliable way for the agent to insure it is truly processing media mentions of the brand?
A customer service agent sometimes fails to complete multi-step workflows when APIs respond slowly or inconsistently.
Which approach most effectively increases robustness when working with unreliable APIs?
An AI engineer at an oil and gas company is designing a multi-agent AI system to support drilling operations. Different agents are responsible for subsurface modeling, risk analysis, and resource allocation. These agents must share operational context, reason through interdependent planning steps, and justify their collaborative decisions using structured, transparent logic. The architecture must support memory persistence, sequential decision-making and chain-of-thought prompting across agents.
Which implementation best supports this design?
When analyzing user feedback patterns to improve a technical documentation agent, which evaluation methods effectively translate feedback into actionable optimization strategies? (Choose two.)