NAME

Emotion - discrete emotion annotation processing

SYNOPSIS

empathize <transcript>
empathize --index <transcript> [<transcript> ...]

DESCRIPTION

The most difficult part of this module is learning how to annotate free-form text with information about what competition is taking place. If you haven't studied http://ghost-wheel.net then go do that now.

The first part of this document describes the XML representation of 3rd person annotations and the validation rules. The second part describes the perl API.

XML TUTORIAL

The Basics

The minimum situation consists of the type of competition and two opponents.

abbreviation  type of competition (situation)
------------  -------------------------------
destroys      [-] destroys [-]
steals        [-] steals from [+]
uneasy        [-] is made uneasy by [0]
exposes       [+] exposes [-]
impasse       [+] and [+] are at an impasse
admires       [+] admires [0]
observes      [0] observes [-]
accepts       [0] accepts [+]
ready         [0] and [0] are at readiness

Each opponent is labelled either left or right, indicating on which side of the situation he or she participates. For example:

<observes left="student" right="teacher">
  A student watches the teacher make a presentation.
</observes>

<observes right="student" left="teacher">
  The teacher watches a student make a presentation.
</observes>

Of course a teacher is probably more accustomed to lecturing in front of a class than a student. This information can be captured with the intensity attribute:

<observes left="student" right="teacher" intensity="gentle">
  A student watches the teacher make a presentation.
</observes>

<observes right="student" left="teacher" intensity="forceful">
  The teacher watches a student make a presentation.
</observes>

Now it is clear that the teacher's presentation involves some gentle excitement, but a student presentation is an exciting test of self-control. Since no spin is involved, the emotion is roughly the same regardless of whether we empathize with the presenter or the audience. To contrast, situations involving spin require some additional structure. For example:

<steals left="thief" right="child">
  The thief steals the teddy bear from the child.
</steals>

Here it is not clear which emotion is intended. Opponents experience the situation very differently. Perhaps the thief is drunk with accomplishment, but the child is probably quite angry. To avoid this ambiguity, situations involving spin follow a transactional structure:

initiator:   (1) before ---+---> (3) after
                           |
                           v
victim:             (2) tension

Here is a hypothetical trace of a steals situation decomposed into the three transaction phases:

<steals left="thief" right="child" before="focused">
  The thief eyes the teddy bear maliciously and wrenches it
  from the child's passionate grip.
</steals>

<steals left="thief" right="child" tension="focused">
  The child becomes furious and starts crying.
</steals>

<steals left="thief" right="child" after="focused">
  The thief triumphantly stows the teddy bear in his backpack.
</steals>

Perhaps this sequence of events is ghastly, but here the goal is merely to model emotions accurately. In this example, the three phases unambiguously correspond to "anxiety," "anger," and "drunk with accomplishment."

The attributes before, tension, and after describe the emotional tension. Tension modifies a general emotion to fit the precise situation. For example:

<steals left="thief" right="child" before="relaxed">
  The thief grabs the teddy bear with abandon.
</steals>

<steals left="thief" right="child" before="stifled">
  The thief studies whether the teddy bear can be stolen.
</steals>

A tension modifier is either focused, relaxed, or stifled. Emotion correspondance charts can assist in fitting the most accurate modifier to the situation. Of course a thief is not always successful:

<steals left="thief" right="child" before="stifled">
  The thief studies whether the teddy bear can be stolen.
</steals>

<impasse left="child" right="thief" tension="focused">
  The child grips the teddy bear even more tightly.
</impasse>

The four situations involving spin (steals, exposes, admires, and accepts) mostly follow the three phase transaction structure. However, flexibility offered by accepts shows the need to explicitly indicate an initiator. For example:

<accepts left="chef" right="child" before="focused">
  I am hungry.  Is dinner ready?
</accepts>

<accepts left="chef" right="child" before="focused">
  You are hungry!
</accepts>

Looking just at the annotation, a reader cannot determine whether the child is making a demand, or the chef is voicing the child's expression of hunger. An initiator attribute eliminates the ambiguity:

<accepts left="chef" right="child" before="focused" initiator="right">
  I am hungry.  Is dinner ready?
</accepts>

<accepts left="chef" right="child" before="focused" initiator="left">
  You are hungry!
</accepts>

Now the situations can be distinguish. While the annotation is precise, it is also quite a lot of typing. To reduce the burden on the analyst, the parser is capable of guessing reasonable defaults by consideration of who is talking. For example:

<accepts left="chef" initiator="child" before="focused">
  I am hungry.  Is dinner ready?
</accepts>

Or alternately:

<talk who="child">
  <accepts left="chef" before="focused">
    I am hungry.  Is dinner ready?
  </accepts>
</talk>

We covered most of the simple variations. Here is a summary:

situation  variations
---------- ---------------------------------
steals     before|tension|after, initiator
exposes    before|tension|after, initiator
admires    before|tension|after, initiator
accepts    before|tension|after, initiator
impasse    tension
observes   intensity, [initiator]
uneasy     intensity, [initiator]
ready      intensity
destroys   -

The square brackets denote optional attributes. More will be said about this later.

Reaction

Question & answer (Q&A) format is a common way to structure discussion. This section looks at the Q&A structure in comparison with the competition model. It will be shown that there are some conflicts in the corresponce, however, the complementary features can be embraced.

Questions usually end with a question mark. Even so, from the point of view of competition, questions are essentially demands. For example:

<accepts left="child" initiator="parent" before="focused">
  Does your teddy bear have brown fur?
</accepts>

In answer, the child's choices include:

<accepts initiator="child" right="parent" before="focused">
  Yes!
</accepts>

Or:

<impasse initiator="child" right="parent" tension="relaxed">
  Which teddy bear?
</accepts>

Notice that similar replies could be expected from the non-question demand, "I want the teddy bear." For example, accepts ("OK, here it is.") or impasse ("Which teddy bear?"). While the question mark accents the demand somehow, its presence or absence does not seem particularly significant in the context of competition.

The answer side of Q&A will now be considered. Answers are generally offered in reaction to questions. However, it is easy to demonstrate ambiguity. For example:

<accepts left="child" initiator="parent" before="focused">
  Does your teddy bear have brown fur?
</accepts>

<impasse initiator="child" right="parent" tension="relaxed">
  Which teddy bear?
</accepts>

Is the child's reply a question or an answer? Perhaps both, but clearly the Q&A terminology is imprecise. Despite these problems, some kind of contextual relationship definitely exists. To encode the Q&A relationship precisely, a clear standard is needed to recognize it.

The standard suggested here is that the participants must match but the initiator must flip-flop. These rules were discovered by surveying many Q&A style exchanges and the rules seem to work well in practice. re (meaning "in response to") and id attributes serve the purpose. For example:

<accepts id="q1" right="child" initiator="adult" before="focused">
  Does your teddy bear have brown fur?
</accepts>

<accepts re="q1" left="parent" initiator="child" before="focused">
  Yes!
</accepts>

Quality

So far, the chosen markup may seem somewhat arbitrary. For example, why indicate merely accepts?

<accepts id="q1" right="child" initiator="adult" before="focused">
  Does your teddy bear have brown fur?
</accepts>

Why not more detail, such as:

<accepts id="q1" right="child" initiator="adult" before="focused">
  Does your
    <admires initiator="child" right="teddy bear" tension="relaxed">
      teddy bear
    </admires>
  have brown fur?
</accepts>

In fact, either annotation could be accurate. The problem is that written text is not rich enough as a medium to thoroughly convey a situation's emotional textual unless we resort to cumbersome contortions (such as XML :-). On the other hand, film (or the actual life experience itself) is quite sufficiently expressive. Still, there may be a question as to what constitutes *quality* markup. For example:

<steals initiator="Yupi" right="Lon" before="focused">
  Yupi gives Lon a beautiful and fragrant flower.
</steals>

Most likely this markup is wrong. A proposed criteria for judging the quality of markup follows:

  • Is it detailed enough to isolate single emotions?

    Looking at a scene as a sequences of moments, every moment can be consider an *independent* competition event. The question is: into how many intervals must we divide the timeline such that each interval is exactly one emotion?

  • Does it span the entire duration of the emotion?

    This duration criteria is consided in the next section, "Resolution."

Resolution

The parser tries to keep track of situations which remain unresolved (or pending). For example:

<accepts id="o1" initiator="clerk" left="Joey" before="focused">
  What toppings do you want on your pizza?
</accepts>

<impasse id="o2" re="o1" left="Joey" right="cleck" tension="stifled">
  Joey puts the phone on hold.
</impasse>

In general, stories don't stop abruptly leaving unresolved cliffhangers. Here is a clue that can be used to debug annotations. After processing a transcript, empathize outputs all the situations which do not find closure. For example, until Joey picks up the phone or the clerk gives up in disgust, the pizza situation will be flagged as unresolved.

Another way to manage resolution is to declare the victim absent. Here is an example:

<talk who="teacher">
  <accepts left="students" before="focused" absent="left">
    The students listen to the teacher's tedious monologue.
  </accepts>
</talk>

In fact, the teacher may never know whether the situation was truly accepts, observes, or even merely:

<ready intensity="gentle" left="students" right="teacher">
  The students are intermittently aware that the
  teacher continues to talk.
</ready>

Notice that the absent attribute is unnecessary in this case. A variety of situations do not trigger any need for resolution, and ready is one of them.

Wildcards

<accepts left="*" initiator="Crawford" before="focused">
  Does anyone know the time?
</accepts>

(Talking about anthropomorphic rivals.)

Context

Here is a prior example updated with new markup:

<steals id="p1" initiator="thief" right="child" before="focused">
  The thief eyes the teddy bear maliciously and wrenches it
  from the child's passionate grip.
</steals>

<steals re="p1" left="thief" initiator="child" tension="focused">
  The child becomes furious and starts crying.
</steals>

<steals context="p1" initiator="thief" right="child" after="focused">
  The thief triumphantly stows the teddy bear in his backpack.
</steals>

This last situation describing the thief gloating over his accomplishment is not really an answer to "p1", but there is certainly some kind of direct contextual connection. The context attribute is available for such cases. context is a catch-all. It is appropriate for any kind of generic relationship for which a more specific classification is not available.

EMOTION LIBRARY API

  • Emotion::set_transcript($xml_file)

  • my $dialog_id = Emotion::set_speaker($who)

  • my @pending = Emotion::unresolved()

Emotion::Atom Methods

  • my $situation = Emotion::Atom->new($expat, $attr)

  • my $emotion = $atom->emotion;

SEE ALSO

http://ghost-wheel.net