langcheck.metrics.metric_value#

class langcheck.metrics.metric_value.MetricValue(metric_name: str, metric_values: List[NumericType], prompts: List[str] | None, generated_outputs: List[str] | tuple[List[str], List[str]] | None, reference_outputs: List[str] | None, sources: List[str] | tuple[List[str] | None, List[str] | None] | None, explanations: List[str | None] | None, language: str | None)[source]#

Bases: Generic[NumericType]

A rich object that is the output of any langcheck.metrics function.

all() bool[source]#

Equivalent to all(metric_value.metric_values). This is mostly useful for binary metric functions.

any() bool[source]#

Equivalent to any(metric_value.metric_values). This is mostly useful for binary metric functions.

explanations: List[str | None] | None#
generated_outputs: List[str] | tuple[List[str], List[str]] | None#
histogram(jupyter_mode: str = 'inline') None[source]#

Shows an interactive histogram of all data points in MetricValue. Intended to be used in a Jupyter notebook.

This is a convenience function that calls langcheck.plot.histogram().

property is_pairwise: bool#
language: str | None#
metric_name: str#
metric_values: List[NumericType]#
prompts: List[str] | None#
reference_outputs: List[str] | None#
scatter(jupyter_mode: str = 'inline') None[source]#

Shows an interactive scatter plot of all data points in MetricValue. Intended to be used in a Jupyter notebook.

This is a convenience function that calls langcheck.plot.scatter().

sources: List[str] | tuple[List[str] | None, List[str] | None] | None#
to_df() DataFrame[source]#

Returns a DataFrame of metric values for each data point.

class langcheck.metrics.metric_value.MetricValueWithThreshold(metric_name: str, metric_values: List[NumericType], prompts: List[str] | None, generated_outputs: List[str] | tuple[List[str], List[str]] | None, reference_outputs: List[str] | None, sources: List[str] | tuple[List[str] | None, List[str] | None] | None, explanations: List[str | None] | None, language: str | None, threshold: float | int, threshold_op: str)[source]#

Bases: MetricValue

A rich object that is the output of comparing an MetricValue object, e.g. metric_value >= 0.5.

all() bool[source]#

Returns True if all data points pass the threshold.

any() bool[source]#

Returns True if any data points pass the threshold.

property pass_rate: float#

Returns the proportion of data points that pass the threshold.

threshold: float | int#
threshold_op: str#
property threshold_results: List[bool]#

Returns a list of booleans indicating whether each data point passes the threshold.

to_df() DataFrame[source]#

Returns a DataFrame of metric values for each data point.