Extended
The Extended Axis evaluates AI emotional intelligence across multiple levels of human-AI interaction, encompassing individual exchanges, social contexts, and system-wide effects.
What We Evaluate
This axis evaluates emotional intelligence at multiple levels of human-AI interaction, including:
- Interactional Flexibility: Immediate adaptive emotional responses.
- Contextual Awareness: Broader social and cultural adaptability.
- Ethical and Environmental Responsibility: Ensuring fairness, transparency, inclusivity, user autonomy, and sustainability.
- Systemic Impact: Evaluating broader societal and ecological implications.
Example Tasks
- Measure how much an operator improves model responses in a dyad.
- Score empathic quality before and after UI edits.
- Track interaction cost for human fixes (clicks, edits, time).
Metrics Used
Human-AI Dyad empathy gain (L3), interaction-cost (L2), scenario goal-completion (L4).