SemEval-2007 Home
 News
 Schedule
 senseval.org


 Call for Tasks
 Call for Participation


 Task Descriptions
 Download Data


 Paper Submissions
 Program Committee
 Program
 Registration


 Organizers
 Administration

 

Task 14: Affective Text

Organizers
Carlo Strapparava and Rada Mihalcea

Web page: http://www.cs.unt.edu/~rada/affectivetext

Motivation
This task is intended as an exploration of the connection between lexical semantics and emotions. All words can potentially convey affective meaning. Every word, even those that are apparently neutral, can evoke pleasant or painful experiences due to their semantic relation with emotional concepts or categories. While some words have emotional meaning with respect to an individual story, for many others the affective power is part of the collective imagination (e.g. words such as "mum", "ghost", "war").

This latter group of words are particularly interesting, because their affective meaning is part of common sense knowledge and can be detected in the linguistic usage. For this reason, we believe it is important to study the use of words in textual productions, and possibly their co-occurrence with words in which the affective meaning is explicit. Several previous studies in linguistics and psychology have considered research issues related to the affective lexicon. For example Ortony et al. [Ortony et al., 1987] distinguishes between words directly referring to emotional states (e.g. "fear", "cheerful") and those having only an indirect reference that depends on the context (e.g. words that indicate possible emotional causes such as "killer" or emotional responses such as "cry").

The automatic detection of emotion in texts is becoming increasingly important from an applicative point of view. Consider for example the tasks of opinion mining and market analysis, affective computing, natural language processing for user-interfaces (e.g. e-learning environments, such as educational/edutainment games). Possible beneficial effects of emotions on memory, attention, and in general on fostering creativity are also well-known in psychology. Finally, news web sites are already very popular and automatic classification of news along emotive dimensions could be useful and interesting.

Task Description
To explore the connection between emotions and lexical semantics we propose to focus on the emotion classification of news headlines extracted from news web sites. The news headlines typically consist of a few words and are often written by creative people with the intention to "provoke" emotions, and consequently to attract the readers' attention.

These characteristics make the news headlines particularly suitable for use in an automatic emotion recognition setting, as the affective/emotional features (if present) are guaranteed to appear in these short sentences.

The structure of the task is as follows:

  • Corpus: News headlines, extracted from news web sites (such as Google news, CNN) and/or newspapers. In the case of web sites, we can easily collect a few thousand titles in a short amount of time.
  • Objective: Provided a set of predefined emotion labels (e.g. joy, fear, surprise), classify the titles with the appropriate emotion label and with a valence indication (positive/negative).

      The emotion annotation and the valence labeling will be regarded as two separate subtasks, and therefore a team can choose to participate in only one or both annotation tasks.

The task will be carried out in an unsupervised setting, and consequently no training will be provided. The reason behind this decision is that we want to emphasize the study of emotion lexical semantics, and avoid biasing the participants toward simple "text categorization" approaches. Nonetheless supervised systems will be not precluded from the participation, and in such cases the participating teams will be allowed to create their own supervised training sets.

The task organizers will provide in advance the set of emotion labels and a development corpus. The timeline of the task will follow the general Semeval-2007 timeframe as follows:

  • [three weeks before the deadline]: a development corpus of approximately 250 headlines annotated for emotions will be provided
  • [one week before the deadline]: the data set of approx. 2,000 news headlines will be provided
  • [deadline]: the participants submit the headlines annotated with emotion labels and/or a valence indication (positive/negative); as done in previous Senseval evaluation, the participating teams are free to choose their own submission deadline, as long as it falls within the window of time imposed by Semeval-2007.
References
[Ortony et al., 1987] A. Ortony, G. L. Clore, and M. A. Foss. The psychological foundations of the affective lexicon. Journal of Personality and Social Psychology, 53:751- 766., 1987.
 

Please e-mail questions to .
Website hosted by the Department of Computer Science at Swarthmore College.