Jupyter Notebooks
Interactive notebooks demonstrating each Fair Forge metric with visualizations.Setup
Available Notebooks
Toxicity
Location:examples/toxicity/jupyter/toxicity.ipynb
Demonstrates toxicity detection with DIDT framework:
- Cluster profiling with HDBSCAN + UMAP
- Group profiling with demographic detection
- Frequentist vs Bayesian statistical modes
- UMAP visualization of response clusters
Bias
Location:examples/bias/jupyter/bias.ipynb
Demonstrates bias detection across protected attributes:
- LlamaGuard guardian configuration
- Clopper-Pearson confidence intervals
- Per-attribute bias analysis
- Error bar visualization
Context
Location:examples/context/jupyter/context.ipynb
Demonstrates context alignment evaluation:
- LLM judge configuration
- Per-interaction scoring
- Insight analysis
- Average score calculation
Conversational
Location:examples/conversational/jupyter/conversational.ipynb
Demonstrates dialogue quality evaluation:
- Grice’s Maxims scoring
- Memory and language assessment
- Radar chart visualization
- Per-maxim distribution analysis
Humanity
Location:examples/humanity/jupyter/humanity.ipynb
Demonstrates emotional analysis:
- NRC Emotion Lexicon analysis
- Emotional entropy calculation
- Spearman correlation with ground truth
- Emotion distribution visualization
BestOf
Location:examples/bestof/jupyter/bestof.ipynb
Demonstrates tournament comparison:
- Multi-assistant dataset setup
- Elimination rounds
- Winner determination
- Tournament bracket visualization
Generators
Location:examples/generators/jupyter/generators_groq.ipynb
Demonstrates synthetic dataset generation:
- Markdown loading and chunking
- Sequential and random sampling strategies
- Conversation mode generation
- Seed examples for guided generation
Runners
Location:examples/runners/jupyter/runners.ipynb
Demonstrates test execution:
- AlquimiaRunner configuration
- Single batch and full dataset execution
- Local storage integration
- Complete pipeline example