Skip to main content
Ctrl+K
uqlm 0.6 documentation - Home uqlm 0.6 documentation - Home
  • Get Started
  • Scorer Definitions
  • API
  • Example Notebooks
  • Contributor Guide
    • FAQs
    • Release Notes
  • GitHub
  • Get Started
  • Scorer Definitions
  • API
  • Example Notebooks
  • Contributor Guide
  • FAQs
  • Release Notes
  • GitHub

Section Navigation

Available Scorers

  • Black-Box Scorers
    • Normalized Semantic Negentropy
    • Semantic Sets Confidence
    • Non-Contradiction Probability
    • Entailment Probability
    • Exact Match Rate
    • BERTScore
    • Normalized Cosine Similarity
  • White-Box Scorers
    • Sequence Probability
    • Length-Normalized Sequence Probability
    • Minimum Token Probability
    • Mean Token Negentropy
    • Minimum Token Negentropy
    • Probability Margin
    • Monte Carlo Sequence Probability
    • Consistency and Confidence (CoCoA)
    • Semantic Negentropy (Token-Probability-Based)
    • Semantic Density
    • P(True)
  • LLM-as-a-Judge Scorers
    • Ternary Judge (True/False/Uncertain)
    • Binary Judge (True/False)
    • Continuous Judge
    • Likert Scale Judge
    • Panel of LLM Judges
  • Ensemble Scorers
    • BS Detector
    • Generalized Ensemble
  • Long-Text Scorers
    • Long-Text Uncertainty Quantification (LUQ)
    • Graph-Based Uncertainty Quantification (LUQ)
    • QA-Based Uncertainty Quantification (LUQ)
  • Code-Generation Scorers
    • Token-Probability Code Scorers
    • Code Similarity Scorers
    • Functional Equivalence Scorers
  • Scorer Definitions

Scorer Definitions#

This section provides formal mathematical definitions for all uncertainty quantification scorers available in UQLM. Each scorer returns a confidence score between 0 and 1, where higher scores indicate a lower likelihood of errors or hallucinations.

For detailed API documentation and usage examples, see the API Reference and Example Notebooks.

Available Scorers

  • Black-Box Scorers
    • Normalized Semantic Negentropy
    • Semantic Sets Confidence
    • Non-Contradiction Probability
    • Entailment Probability
    • Exact Match Rate
    • BERTScore
    • Normalized Cosine Similarity
  • White-Box Scorers
    • Single-Generation Scorers
    • Multi-Generation Scorers
  • LLM-as-a-Judge Scorers
    • Ternary Judge (True/False/Uncertain)
    • Binary Judge (True/False)
    • Continuous Judge
    • Likert Scale Judge
    • Panel of Judges
  • Ensemble Scorers
    • BS Detector
    • Generalized Ensemble
  • Long-Text Scorers
    • Long-Text Scoring Methods
  • Code-Generation Scorers
    • Code-Generation Scoring Methods

previous

Quickstart Guide

next

Black-Box Scorers

This Page

  • Show Source

© Copyright 2025, CVS Health.

Created using Sphinx 7.4.7.

Built with the PyData Sphinx Theme 0.16.1.