-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Pull requests: confident-ai/deepeval
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add penalize_ambiguous_claims to AnswerRelevancyMetric
#2573
opened Mar 25, 2026 by
Krishnachaitanyakc
Loading…
3 tasks
fix(gemini): fix log_probs detection, temperature guard, and add Gemini 3.x models
#2572
opened Mar 23, 2026 by
GerardoYalo
Loading…
Add test_case_id support to ConfidentSpanExporter
#2570
opened Mar 22, 2026 by
AlexMaggioni
Loading…
1 task done
fix(ragas): update capture_metric_type call for new telemetry signature
#2568
opened Mar 22, 2026 by
sachinML
Loading…
Add GoodMem integration for memory-powered retrieval
#2566
opened Mar 19, 2026 by
bassammalik
Loading…
4 of 5 tasks
Fix: Parse CSV Single-Turn Golden ToolCalls as JSON objects
#2565
opened Mar 19, 2026 by
seankelley-dt
Loading…
fix: include tool and trace state in evaluation cache keys
#2561
opened Mar 19, 2026 by
aerosta
Loading…
fix: preserve metric snapshots when async metric tasks fail in indicator
#2560
opened Mar 18, 2026 by
aerosta
Loading…
Fix NoneType crash in trimAndLoadJson when LLM returns None
#2558
opened Mar 18, 2026 by
joaquinhuigomez
Loading…
feat: add native Groq model integration for high-speed evaluations
#2556
opened Mar 17, 2026 by
Jayachander123
Loading…
4 tasks done
fix: avoid division by zero in GEval score normalization
#2541
opened Mar 8, 2026 by
aerosta
Loading…
fix: prevent double-wrapping in KnowledgeRetentionMetric extraction
#2536
opened Mar 6, 2026 by
koriyoshi2041
Loading…
1 task done
feat: add user-defined prompt_builder to Synthesizer
#2519
opened Mar 1, 2026 by
rahulmansharamani14
Loading…
Add MLLMDocument to support PDF inputs in LLMTestCase
#2516
opened Feb 28, 2026 by
Fizza-Mukhtar
Loading…
fix: actual_output and tools_called are None when using ConfidentInstrumentationSettings
#2515
opened Feb 27, 2026 by
Fizza-Mukhtar
Loading…
tests(docs): add component-level eval tracing and dataset IO tests
#2433
opened Jan 14, 2026 by
BloggerBust
Loading…
feat: add DeepEval + E2B sandbox SWE evaluation pipeline example
#2430
opened Jan 12, 2026 by
Ayush7614
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.