AI EQ Research Hub - SERA-X Axes

Sense

This axis evaluates an AI system’s ability to detect and recognize emotional content from input.

Can the model identify expressed or implied emotions from user dialogue, even when indirect or masked?

Macro-F1 for emotion label classification, CCC for valence/arousal regression.

This axis tests whether the system can infer the causes or appraisals that explain an emotion.

Can the system identify what situational factors or beliefs caused an emotion?

Appraisal vector accuracy, cause-label accuracy, partial credit for next-emotion inference.

This axis evaluates whether the AI can generate helpful, emotionally appropriate responses.

Do responses show empathy, helpfulness, and alignment with the user’s affective state?

Human Empathy Score (HES) rated on 1–5 scale.

This axis measures how well the model adjusts to new cultural, linguistic, or situational emotion norms.

Does the system retain old skills while adapting to new affective expressions and norms?

Forward transfer gain and catastrophic forgetting in “Experience Pack” tasks.

The Extended Axis evaluates AI emotional intelligence across interactions, social contexts, and system-level effects.

This axis evaluates emotional intelligence at multiple levels of human-AI interaction, including:

Interactional Flexibility: Immediate adaptive emotional responses.
Contextual Awareness: Broader social and cultural adaptability.
Ethical and Environmental Responsibility: Ensuring fairness, transparency, inclusivity, user autonomy, and sustainability.
Systemic Impact: Evaluating broader societal and ecological implications.

Human-AI Dyad empathy gain, interaction-cost, scenario goal-completion.