Adapt
This axis measures how well the model adjusts to new cultural, linguistic, or situational emotion norms.
What We Evaluate
Does the system retain old skills while adapting to new affective expressions and norms?
Example Tasks
- Classify emotions in a dataset using culturally specific idioms (e.g., 'saudade', 'amae').
- Respond empathetically to unfamiliar dialogue styles or memes.
- Compare performance before and after fine-tuning on a new domain.
Metrics Used
Forward transfer gain and catastrophic forgetting (backward transfer drop) in 'Experience Pack' tasks.