<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="http://coco-lab.org/Elgg/news/rss/rssstyles.xsl"/?>
<rss version='2.0'   xmlns:dc='http://purl.org/dc/elements/1.1/'>
<channel xml:base='http://coco-lab.org/Elgg//weblog/'>
    <title><![CDATA[das : RSS Feed]]></title>
    <description><![CDATA[RSS Feed showing user for das using the Elgg software]]></description>
    <generator>Elgg</generator>
    <link>http://coco-lab.org/Elgg/activity/user/das/summary/all/all/1</link>        
        <item>
            <title><![CDATA[minutes080508]]></title>
            <link>http://coco-lab.org/Elgg/das/page/minutes080508</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/minutes080508</guid>
            <pubDate>05/08/08 10:02</pubDate>
            <description><![CDATA[<div>&nbsp;&nbsp;- present: M, T, D</div><div>&nbsp;&nbsp;- priority: close processing chain. Finally get something from ASR</div><div>&nbsp;&nbsp; &nbsp;to parser to DM to TTS -- even if it is only a parrot system!</div><div>&nbsp;&nbsp;- Dialogue Manager:</div><div>&nbsp;&nbsp; &nbsp;- can be something like Dipper, i.e. information-state update</div><div>&nbsp;&nbsp; &nbsp; &nbsp;based..&nbsp;</div><div>&nbsp;&nbsp; &nbsp;- or FSA (specified in SCXML or similar?)</div><div>&nbsp;&nbsp; &nbsp;- but rules can be simple anyway, simple FSA-stuff:</div><div>&nbsp;&nbsp; &nbsp; &nbsp;- identification -&gt; confirmation (repeat on negative) -&gt;</div><div>&nbsp;&nbsp; &nbsp; &nbsp; &nbsp;orientation -&gt; confirmation (repeat on negative) -&gt; placement</div><div>&nbsp;&nbsp; &nbsp; &nbsp; &nbsp;(repeat on negative)</div><div>&nbsp;&nbsp; &nbsp;- S: &quot;Welche Teil?&quot; U: &quot;Das zweite von links&quot; S: &quot;Das hier?&quot;&nbsp;</div><div>&nbsp;&nbsp; &nbsp; &nbsp;U: &quot;Nein, daneben&quot;. [ --&gt; need to be able to deal with</div><div>&nbsp;&nbsp; &nbsp; &nbsp;context-dependent utterances ]</div><div>&nbsp;&nbsp;- do WOz pretty soon? Wizard hears user utterances, can trigger</div><div>&nbsp;&nbsp; &nbsp;simple prompts:</div><div>&nbsp;&nbsp; &nbsp;- &quot;Welches Teil?&quot; &quot;Soll ich es drehen&quot; &quot;Wohin?&quot;;</div><div>&nbsp;&nbsp; &nbsp; &nbsp;&quot;So?&quot; &quot;Hier?&quot;</div><div>&nbsp;&nbsp; &nbsp;- to hide that Wizard is human, let GUI do mouse movements? I.e.,</div><div>&nbsp;&nbsp; &nbsp; &nbsp;wizard selects parameters of action (selecting piece, rotating</div><div>&nbsp;&nbsp; &nbsp; &nbsp;it, dragging it), then selects prompt (&quot;So?&quot;); this is then sent</div><div>&nbsp;&nbsp; &nbsp; &nbsp;to system which executes action (e.g., computes and executes</div><div>&nbsp;&nbsp; &nbsp; &nbsp;mouse path; plays synchronised utterance). This won&#39;t allow us</div><div>&nbsp;&nbsp; &nbsp; &nbsp;to test reaction to smooth turn-taking (since it is</div><div>&nbsp;&nbsp; &nbsp; &nbsp;non-incremental; the wizard will have to fully specify the</div><div>&nbsp;&nbsp; &nbsp; &nbsp;action), but it will allow us to test user reactions &amp; learn</div><div>&nbsp;&nbsp; &nbsp; &nbsp;about the complexity of their speech. Especially the reactions to</div><div>&nbsp;&nbsp; &nbsp; &nbsp;CRs like &quot;so?&quot;. E.g., &quot;nein, eins weiter hoch&quot;.</div><div><div><br /></div><div><br /></div></div><div>THE FRIGGING WIKI IS BROKEN. you can find the complete minutes on my weblog.</div>]]></description>
        </item>        
        <item>
            <title><![CDATA[minutes080508]]></title>
            <link>http://coco-lab.org/Elgg/das/page/minutes080508</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/minutes080508</guid>
            <pubDate>05/08/08 10:00</pubDate>
            <description><![CDATA[<div>&nbsp;&nbsp;- present: M, T, D</div><div>&nbsp;&nbsp;- priority: close processing chain. Finally get something from ASR</div><div>&nbsp;&nbsp; &nbsp;to parser to DM to TTS -- even if it is only a parrot system!</div><div>&nbsp;&nbsp;- Dialogue Manager:</div><div>&nbsp;&nbsp; &nbsp;- can be something like Dipper, i.e. information-state update</div><div>&nbsp;&nbsp; &nbsp; &nbsp;based..&nbsp;</div><div>&nbsp;&nbsp; &nbsp;- or FSA (specified in SCXML or similar?)</div><div>&nbsp;&nbsp; &nbsp;- but rules can be simple anyway, simple FSA-stuff:</div><div>&nbsp;&nbsp; &nbsp; &nbsp;- identification -&gt; confirmation (repeat on negative) -&gt;</div><div>&nbsp;&nbsp; &nbsp; &nbsp; &nbsp;orientation -&gt; confirmation (repeat on negative) -&gt; placement</div><div>&nbsp;&nbsp; &nbsp; &nbsp; &nbsp;(repeat on negative)</div><div>&nbsp;&nbsp; &nbsp;- S: &quot;Welche Teil?&quot; U: &quot;Das zweite von links&quot; S: &quot;Das hier?&quot;&nbsp;</div><div>&nbsp;&nbsp; &nbsp; &nbsp;U: &quot;Nein, daneben&quot;. [ --&gt; need to be able to deal with</div><div>&nbsp;&nbsp; &nbsp; &nbsp;context-dependent utterances ]</div><div>&nbsp;&nbsp;- do WOz pretty soon? Wizard hears user utterances, can trigger</div><div>&nbsp;&nbsp; &nbsp;simple prompts:</div><div>&nbsp;&nbsp; &nbsp;- &quot;Welches Teil?&quot; &quot;Soll ich es drehen&quot; &quot;Wohin?&quot;;</div><div>&nbsp;&nbsp; &nbsp; &nbsp;&quot;So?&quot; &quot;Hier?&quot;</div><div>&nbsp;&nbsp; &nbsp;- to hide that Wizard is human, let GUI do mouse movements? I.e.,</div><div>&nbsp;&nbsp; &nbsp; &nbsp;wizard selects parameters of action (selecting piece, rotating</div><div>&nbsp;&nbsp; &nbsp; &nbsp;it, dragging it), then selects prompt (&quot;So?&quot;); this is then sent</div><div>&nbsp;&nbsp; &nbsp; &nbsp;to system which executes action (e.g., computes and executes</div><div>&nbsp;&nbsp; &nbsp; &nbsp;mouse path; plays synchronised utterance). This won&#39;t allow us</div><div>&nbsp;&nbsp; &nbsp; &nbsp;to test reaction to smooth turn-taking (since it is</div><div>&nbsp;&nbsp; &nbsp; &nbsp;non-incremental; the wizard will have to fully specify the</div><div>&nbsp;&nbsp; &nbsp; &nbsp;action), but it will allow us to test user reactions &amp; learn</div><div>&nbsp;&nbsp; &nbsp; &nbsp;about the complexity of their speech. Especially the reactions to</div><div>&nbsp;&nbsp; &nbsp; &nbsp;CRs like &quot;so?&quot;. E.g., &quot;nein, eins weiter hoch&quot;.</div><div><br /></div><div><br /></div>]]></description>
        </item>        
        <item>
            <title><![CDATA[Home Page]]></title>
            <link>http://coco-lab.org/Elgg/das/page/Home+Page</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/Home+Page</guid>
            <pubDate>05/08/08 09:59</pubDate>
            <description><![CDATA[<p>Besprechungsprotokolle / meeting minutes</p><p>(newest first)</p><p>05/05/08 <a href="http://coco-lab.org/Elgg/das/page/minutes080508">minutes080508</a>&nbsp;</p><p>14/04/08 <a href="http://coco-lab.org/Elgg/das/page/minutes140408">minutes140408</a></p><p>03/02/08 <a href="http://coco-lab.org/Elgg/das/page/minutes030208b">minutes030208b</a></p><p>04/12/07 <a href="http://coco-lab.org/Elgg/das/page/minutes041207">minutes041207</a></p><p>26/11/07 <a href="http://coco-lab.org/Elgg/timo/page/hours+20071126">@Timo</a></p><p>19/11/07 <a href="http://coco-lab.org/Elgg/das/page/minutes191107">minutes191107</a></p><p>13/11/07 <a href="http://coco-lab.org/Elgg/das/page/minutes131107">minutes131107</a></p><p>05/11/07 <a href="http://coco-lab.org/Elgg/das/page/minutes051107">minutes051107</a></p><p>22/10/07 <a href="http://coco-lab.org/Elgg/das/page/minutes221007">minutes221007</a></p><p>01/10/07 <a href="http://coco-lab.org/Elgg/das/page/minutes2007_10_01">minutes2007_10_01</a></p><p>10/09/07 <a href="http://coco-lab.org/Elgg/das/page/minutes100907">minutes100907</a></p><p>23/08/07 <a href="http://coco-lab.org/Elgg/das/page/minutes230807">minutes230807</a></p><p>03/07/07 <a href="http://coco-lab.org/Elgg/das/page/minutes030707">minutes030707</a></p><p>19/06/07 <a href="http://coco-lab.org/Elgg/das/page/minutes190607">minutes190607</a></p><p>05/06/07 <a href="http://coco-lab.org/Elgg/das/page/minutes050607_zeitwort2">minutes050607_zeitwort2</a></p><p>21/05/07 <a href="http://coco-lab.org/Elgg/das/page/minutes210507">minutes210507</a></p><p>&nbsp;</p><p>Sonstiges</p><p>&nbsp;</p><p><a href="http://coco-lab.org/Elgg/das/page/Conferences2008">Conferences2008</a></p>]]></description>
        </item>        
        <item>
            <title><![CDATA[minutes140408]]></title>
            <link>http://coco-lab.org/Elgg/das/page/minutes140408</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/minutes140408</guid>
            <pubDate>04/14/08 23:40</pubDate>
            <description><![CDATA[<div>- InPro, meeting, minutes, 14/04/08</div><div>&nbsp; - present: M, T, G, D</div><div>&nbsp; - Gabriel demo&#39;ed current state of Higgins. Displays duration of</div><div>&nbsp;   vocal action (both recognised and own) on timeline, uses simple</div><div>&nbsp;   boundary tone classification (up, down) to base decisions on</div><div>&nbsp;   thresholding on. (This is mostly a test of the architecture at the</div><div>&nbsp;   moment, the strategies are very simple.)</div><div>&nbsp; - dysfluencies: what to do with aborted words? Most likely, sphinx</div><div>&nbsp;   will recognise rubbish. Would be too unrestrictive to include</div><div>&nbsp;   aborted versions of all words; adding other methods (e.g., using</div><div>&nbsp;   prosodic info) would require too much changes at low level of</div><div>&nbsp;   ASR. (Hm. But at some point we&#39;ll have frame-level /</div><div>&nbsp;   syllable-level prosodic info anyway. Shouldn&#39;t be too hard to let</div><div>&nbsp;   classifier judge whether word was perhaps misrecognised because it</div><div>&nbsp;   was a different, aborted one.)</div><div>&nbsp; - Timo and Gabriel will work together on getting better classifier</div><div>&nbsp;   for boundary tone detection to work. Does it need to do speaker</div><div>&nbsp;   adaptation?</div><div>&nbsp; - first step on syntax side: toy grammar for Pento domain.</div><div>&nbsp;   (``Nimm das {Kreuz | Teil | lange Ding} aus der Mitte links</div><div>&nbsp;   oben&#39;&#39;) in Higgins parser.</div><div>&nbsp; - using a grammar as linguistic model in sphinx apparently doesn&#39;t</div><div>&nbsp;   work incrementally (doesn&#39;t return results before top category has</div><div>&nbsp;   been found), but using statistical LM does work. (Although there</div><div>&nbsp;   are still technical problems, but it looks promising.)</div><div>&nbsp; - even if we can&#39;t use a grammar, we can still bootstrap an n-gram</div><div>&nbsp;   LM with utterances generated from a domain grammar.</div><div><br /></div>]]></description>
        </item>        
        <item>
            <title><![CDATA[Home Page]]></title>
            <link>http://coco-lab.org/Elgg/das/page/Home+Page</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/Home+Page</guid>
            <pubDate>04/14/08 23:39</pubDate>
            <description><![CDATA[<p>Besprechungsprotokolle / meeting minutes</p><p>(newest first)</p><p>14/04/08 <a href="http://coco-lab.org/Elgg/das/page/minutes140408">minutes140408</a></p><p>03/02/08 <a href="http://coco-lab.org/Elgg/das/page/minutes030208b">minutes030208b</a></p><p>04/12/07 <a href="http://coco-lab.org/Elgg/das/page/minutes041207">minutes041207</a></p><p>26/11/07 <a href="http://coco-lab.org/Elgg/timo/page/hours+20071126">@Timo</a></p><p>19/11/07 <a href="http://coco-lab.org/Elgg/das/page/minutes191107">minutes191107</a></p><p>13/11/07 <a href="http://coco-lab.org/Elgg/das/page/minutes131107">minutes131107</a></p><p>05/11/07 <a href="http://coco-lab.org/Elgg/das/page/minutes051107">minutes051107</a></p><p>22/10/07 <a href="http://coco-lab.org/Elgg/das/page/minutes221007">minutes221007</a></p><p>01/10/07 <a href="http://coco-lab.org/Elgg/das/page/minutes2007_10_01">minutes2007_10_01</a></p><p>10/09/07 <a href="http://coco-lab.org/Elgg/das/page/minutes100907">minutes100907</a></p><p>23/08/07 <a href="http://coco-lab.org/Elgg/das/page/minutes230807">minutes230807</a></p><p>03/07/07 <a href="http://coco-lab.org/Elgg/das/page/minutes030707">minutes030707</a></p><p>19/06/07 <a href="http://coco-lab.org/Elgg/das/page/minutes190607">minutes190607</a></p><p>05/06/07 <a href="http://coco-lab.org/Elgg/das/page/minutes050607_zeitwort2">minutes050607_zeitwort2</a></p><p>21/05/07 <a href="http://coco-lab.org/Elgg/das/page/minutes210507">minutes210507</a></p><p>&nbsp;</p><p>Sonstiges</p><p>&nbsp;</p><p><a href="http://coco-lab.org/Elgg/das/page/Conferences2008">Conferences2008</a></p>]]></description>
        </item>        
        <item>
            <title><![CDATA[030208cont]]></title>
            <link>http://coco-lab.org/Elgg/das/page/030208cont</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/030208cont</guid>
            <pubDate>03/03/08 22:37</pubDate>
            <description><![CDATA[<div>&nbsp; - kurzfristige Projekte:</div><div>&nbsp;   - bababa2, SIGdial Poster</div><div>&nbsp;     - TO DOs, unprioritisiert: a) Silbengrenzen, von</div><div>&nbsp;       Aussprachew&ouml;rterbuch kommend; b) echtes Audio verwenden,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>Kielkorpus; c) ASR verwenden, W&ouml;rter, ngramme; d) bessere</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>speech states, phrasengrenzen (f. BCs); e) besser</div><div>&nbsp;       TT-Strategien; f) simulation, constant time &lt; (or &gt;)</div><div>&nbsp;       real-time; g) bessere Evaluation; h) interruption management;</div><div>&nbsp;       i) BC management; j) Parametrisierung (chattiness,</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>interruption propability, etc.); k) adaptivity</div><div>&nbsp;     - m&ouml;gliche Ans&auml;tze f. Paper:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- in Richtung David T., `believable, non-scripted content-free</div><div>&nbsp;         background chatter&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; Nicht sehr &uuml;berzeugend; um online erzeugt zu werden, doch</div><div>&nbsp;         ein wenig resourcenhungrig. Nur f&uuml;r Hintergrundgerede w&uuml;rde</div><div>&nbsp;         das wohl niemand ernsthaft einsetzen.</div><div>&nbsp;       - `simple rules create realistic turn-taking patterns&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; SSJ rules as *generative* rules, not just descriptive. Shows</div><div>&nbsp;         that such a set of rules, together w/ some audio magic, are</div><div>&nbsp;         enough to produce patterns that are `natural&#39; (in a way that</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; needs to be defined properly). Again sort of upper-bound; to</div><div>&nbsp;         get something like this working properly within a real</div><div>&nbsp;         system, here&#39;s what we would need in terms of components.</div><div>&nbsp;         - to do first: b), d), e), g).</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; - needed: more principled metric for `naturalness&#39; of</div><div>&nbsp;           resulting corpus. Multi-dimensional: distribution of gaps</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   &amp; overlaps, balance btw speakers, turn length (in time,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   but also # of utterances).</div><div>&nbsp;   - `syntactic and prosodic language modelling for incremental</div><div>&nbsp;     utterance segmentation&#39;, f&uuml;r Coling</div><div>&nbsp;     utterance end pointing, but in an incremental set up. Needed to</div><div>&nbsp;     know where to clear the chart of the parser. Connected to a</div><div>&nbsp;     well-researched task (i.e., easy to motivate &amp; compare), but</div><div>&nbsp;     different in that we don&#39;t allow (as much?) right context.</div><div>&nbsp;     - method:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- select only multi-utterance turns; EOUs to find are the</div><div>&nbsp;         turn-internal ones.</div><div>&nbsp;       - use original data &amp; variants w/ various WER.</div><div>&nbsp;&nbsp;        Those need plausible time information. How much does</div><div>&nbsp;         this degrade performance?</div><div><br /></div><div>&nbsp;     - what&#39;s a good way to evaluate this? follow-on effects of wrong</div><div>&nbsp;       decisions: an insert for example makes us restart the parser,</div><div>&nbsp;       and hence get other things wrong?</div><div><br /></div>]]></description>
        </item>        
        <item>
            <title><![CDATA[030208cont]]></title>
            <link>http://coco-lab.org/Elgg/das/page/030208cont</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/030208cont</guid>
            <pubDate>03/03/08 22:36</pubDate>
            <description><![CDATA[<div>&nbsp; - kurzfristige Projekte:</div><div>&nbsp;   - bababa2, SIGdial Poster</div><div>&nbsp;     - TO DOs, unprioritisiert: a) Silbengrenzen, von</div><div>&nbsp;       Aussprachew&ouml;rterbuch kommend; b) echtes Audio verwenden,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>Kielkorpus; c) ASR verwenden, W&ouml;rter, ngramme; d) bessere</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>speech states, phrasengrenzen (f. BCs); e) besser</div><div>&nbsp;       TT-Strategien; f) simulation, constant time &lt; (or &gt;)</div><div>&nbsp;       real-time; g) bessere Evaluation; h) interruption management;</div><div>&nbsp;       i) BC management; j) Parametrisierung (chattiness,</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>interruption propability, etc.); k) adaptivity</div><div>&nbsp;     - m&ouml;gliche Ans&auml;tze f. Paper:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- in Richtung David T., `believable, non-scripted content-free</div><div>&nbsp;         background chatter&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; Nicht sehr &uuml;berzeugend; um online erzeugt zu werden, doch</div><div>&nbsp;         ein wenig resourcenhungrig. Nur f&uuml;r Hintergrundgerede w&uuml;rde</div><div>&nbsp;         das wohl niemand ernsthaft einsetzen.</div><div>&nbsp;       - `simple rules create realistic turn-taking patterns&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; SSJ rules as *generative* rules, not just descriptive. Shows</div><div>&nbsp;         that such a set of rules, together w/ some audio magic, are</div><div>&nbsp;         enough to produce patterns that are `natural&#39; (in a way that</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; needs to be defined properly). Again sort of upper-bound; to</div><div>&nbsp;         get something like this working properly within a real</div><div>&nbsp;         system, here&#39;s what we would need in terms of components.</div><div>&nbsp;         - to do first: b), d), e), g).</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; - needed: more principled metric for `naturalness&#39; of</div><div>&nbsp;           resulting corpus. Multi-dimensional: distribution of gaps</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   &amp; overlaps, balance btw speakers, turn length (in time,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   but also # of utterances).</div><div>&nbsp;   - `syntactic and prosodic language modelling for incremental</div><div>&nbsp;     utterance segmentation&#39;, f&uuml;r Coling</div><div>&nbsp;     utterance end pointing, but in an incremental set up. Needed to</div><div>&nbsp;     know where to clear the chart of the parser. Connected to a</div><div>&nbsp;     well-researched task (i.e., easy to motivate &amp; compare), but</div><div>&nbsp;     different in that we don&#39;t allow (as much?) right context.</div><div>&nbsp;     - method:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- select only multi-utterance turns; EOUs to find are the</div><div>&nbsp;         turn-internal ones.</div><div>&nbsp;       - use original data &amp; variants w/ various WER.</div><div>&nbsp;     - what&#39;s a good way to evaluate this? follow-on effects of wrong</div><div>&nbsp;       decisions: an insert for example makes us restart the parser,</div><div>&nbsp;       and hence get other things wrong?</div><div><br /></div>]]></description>
        </item>        
        <item>
            <title><![CDATA[030208cont]]></title>
            <link>http://coco-lab.org/Elgg/das/page/030208cont</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/030208cont</guid>
            <pubDate>03/03/08 22:35</pubDate>
            <description><![CDATA[<div>&nbsp; - kurzfristige Projekte:</div><div>&nbsp;   - bababa2, SIGdial Poster</div><div>&nbsp;     - TO DOs, unprioritisiert: a) Silbengrenzen, von</div><div>&nbsp;       Aussprachew&ouml;rterbuch kommend; b) echtes Audio verwenden,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>Kielkorpus; c) ASR verwenden, W&ouml;rter, ngramme; d) bessere</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>speech states, phrasengrenzen (f. BCs); e) besser</div><div>&nbsp;       TT-Strategien; f) simulation, constant time &lt; (or &gt;)</div><div>&nbsp;       real-time; g) bessere Evaluation; h) interruption management;</div><div>&nbsp;       i) BC management; j) Parametrisierung (chattiness,</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>interruption propability, etc.); k) adaptivity</div><div>&nbsp;     - m&ouml;gliche Ans&auml;tze f. Paper:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- in Richtung David T., `believable, non-scripted content-free</div><div>&nbsp;         background chatter&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; Nicht sehr &uuml;berzeugend; um online erzeugt zu werden, doch</div><div>&nbsp;         ein wenig resourcenhungrig. Nur f&uuml;r Hintergrundgerede w&uuml;rde</div><div>&nbsp;         das wohl niemand ernsthaft einsetzen.</div><div>&nbsp;       - `simple rules create realistic turn-taking patterns&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; SSJ rules as *generative* rules, not just descriptive. Shows</div><div>&nbsp;         that such a set of rules, together w/ some audio magic, are</div><div>&nbsp;         enough to produce patterns that are `natural&#39; (in a way that</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; needs to be defined properly). Again sort of upper-bound; to</div><div>&nbsp;         get something like this working properly within a real</div><div>&nbsp;         system, here&#39;s what we would need in terms of components.</div><div>&nbsp;         - to do first: b), d), e), g).</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; - needed: more principled metric for `naturalness&#39; of</div><div>&nbsp;           resulting corpus. Multi-dimensional: distribution of gaps</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   &amp; overlaps, balance btw speakers, turn length (in time,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   but also # of utterances).</div><div>&nbsp;   - `syntactic and prosodic language modelling for incremental</div><div>&nbsp;     utterance segmentation&#39;, f&uuml;r Coling</div><div>&nbsp;     utterance end pointing, but in an incremental set up. Needed to</div><div>&nbsp;     know where to clear the chart of the parser. Connected to a</div><div>&nbsp;     well-researched task (i.e., easy to motivate &amp; compare), but</div><div>&nbsp;     different in that we don&#39;t allow (as much?) right context.</div><div>&nbsp;     - method:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- select only multi-utterance turns; EOUs to find are the</div><div>&nbsp;         turn-internal ones.</div><div>&nbsp;       - use original data &amp; variants w/ various WER.</div><div>&nbsp;     - what&#39;s a good way to evaluate this? follow-on effects of wrong</div><div>&nbsp;       decisions: an insert for example makes us restart the parser,</div><div>&nbsp;       and hence get other things wrong?</div><div><br /></div>]]></description>
        </item>        
        <item>
            <title><![CDATA[030208cont]]></title>
            <link>http://coco-lab.org/Elgg/das/page/030208cont</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/030208cont</guid>
            <pubDate>03/03/08 22:35</pubDate>
            <description><![CDATA[<div>&nbsp; - kurzfristige Projekte:</div><div>&nbsp;   - bababa2, SIGdial Poster</div><div>&nbsp;     - TO DOs, unprioritisiert: a) Silbengrenzen, von</div><div>&nbsp;       Aussprachew&ouml;rterbuch kommend; b) echtes Audio verwenden,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>Kielkorpus; c) ASR verwenden, W&ouml;rter, ngramme; d) bessere</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>speech states, phrasengrenzen (f. BCs); e) besser</div><div>&nbsp;       TT-Strategien; f) simulation, constant time &lt; (or &gt;)</div><div>&nbsp;       real-time; g) bessere Evaluation; h) interruption management;</div><div>&nbsp;       i) BC management; j) Parametrisierung (chattiness,</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>interruption propability, etc.); k) adaptivity</div><div>&nbsp;     - m&ouml;gliche Ans&auml;tze f. Paper:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- in Richtung David T., `believable, non-scripted content-free</div><div>&nbsp;         background chatter&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; Nicht sehr &uuml;berzeugend; um online erzeugt zu werden, doch</div><div>&nbsp;         ein wenig resourcenhungrig. Nur f&uuml;r Hintergrundgerede w&uuml;rde</div><div>&nbsp;         das wohl niemand ernsthaft einsetzen.</div><div>&nbsp;       - `simple rules create realistic turn-taking patterns&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; SSJ rules as *generative* rules, not just descriptive. Shows</div><div>&nbsp;         that such a set of rules, together w/ some audio magic, are</div><div>&nbsp;         enough to produce patterns that are `natural&#39; (in a way that</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; needs to be defined properly). Again sort of upper-bound; to</div><div>&nbsp;         get something like this working properly within a real</div><div>&nbsp;         system, here&#39;s what we would need in terms of components.</div><div>&nbsp;         - to do first: b), d), e), g).</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; - needed: more principled metric for `naturalness&#39; of</div><div>&nbsp;           resulting corpus. Multi-dimensional: distribution of gaps</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   &amp; overlaps, balance btw speakers, turn length (in time,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   but also # of utterances).</div><div>&nbsp;   - `syntactic and prosodic language modelling for incremental</div><div>&nbsp;     utterance segmentation&#39;, f&uuml;r Coling</div><div>&nbsp;     utterance end pointing, but in an incremental set up. Needed to</div><div>&nbsp;     know where to clear the chart of the parser. Connected to a</div><div>&nbsp;     well-researched task (i.e., easy to motivate &amp; compare), but</div><div>&nbsp;     different in that we don&#39;t allow (as much?) right context.</div><div>&nbsp;     - method:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- select only multi-utterance turns; EOUs to find are the</div><div>&nbsp;         turn-internal ones.</div><div>&nbsp;       - use original data &amp; variants w/ various WER</div><div>&nbsp;     - what&#39;s a good way to evaluate this? follow-on effects of wrong</div><div>&nbsp;       decisions: an insert for example makes us restart the parser,</div><div>&nbsp;       and hence get other things wrong?</div><div><br /></div>]]></description>
        </item>        
        <item>
            <title><![CDATA[030208cont]]></title>
            <link>http://coco-lab.org/Elgg/das/page/030208cont</link>
            <guid isPermaLink="true">http://coco-lab.org/Elgg/das/page/030208cont</guid>
            <pubDate>03/03/08 22:34</pubDate>
            <description><![CDATA[<div>&nbsp; - kurzfristige Projekte:</div><div>&nbsp;   - bababa2, SIGdial Poster</div><div>&nbsp;     - TO DOs, unprioritisiert: a) Silbengrenzen, von</div><div>&nbsp;       Aussprachew&ouml;rterbuch kommend; b) echtes Audio verwenden,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>Kielkorpus; c) ASR verwenden, W&ouml;rter, ngramme; d) bessere</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>speech states, phrasengrenzen (f. BCs); e) besser</div><div>&nbsp;       TT-Strategien; f) simulation, constant time &lt; (or &gt;)</div><div>&nbsp;       real-time; g) bessere Evaluation; h) interruption management;</div><div>&nbsp;       i) BC management; j) Parametrisierung (chattiness,</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>interruption propability, etc.); k) adaptivity</div><div>&nbsp;     - m&ouml;gliche Ans&auml;tze f. Paper:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- in Richtung David T., `believable, non-scripted content-free</div><div>&nbsp;         background chatter&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; Nicht sehr &uuml;berzeugend; um online erzeugt zu werden, doch</div><div>&nbsp;         ein wenig resourcenhungrig. Nur f&uuml;r Hintergrundgerede w&uuml;rde</div><div>&nbsp;         das wohl niemand ernsthaft einsetzen.</div><div>&nbsp;       - `simple rules create realistic turn-taking patterns&#39;</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; SSJ rules as *generative* rules, not just descriptive. Shows</div><div>&nbsp;         that such a set of rules, together w/ some audio magic, are</div><div>&nbsp;         enough to produce patterns that are `natural&#39; (in a way that</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; needs to be defined properly). Again sort of upper-bound; to</div><div>&nbsp;         get something like this working properly within a real</div><div>&nbsp;         system, here&#39;s what we would need in terms of components.</div><div>&nbsp;         - to do first: b), d), e), g).</div><div><span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp; - needed: more principled metric for `naturalness&#39; of</div><div>&nbsp;           resulting corpus. Multi-dimensional: distribution of gaps</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   &amp; overlaps, balance btw speakers, turn length (in time,</div><div>&nbsp; <span class="Apple-tab-span"  style="white-space: pre">	</span>&nbsp;   but also # of utterances).</div><div>&nbsp;   - `syntactic and prosodic language modelling for incremental</div><div>&nbsp;     utterance segmentation&#39;, f&uuml;r Coling</div><div>&nbsp;     utterance end pointing, but in an incremental set up. Needed to</div><div>&nbsp;     know where to clear the chart of the parser. Connected to a</div><div>&nbsp;     well-researched task (i.e., easy to motivate &amp; compare), but</div><div>&nbsp;     different in that we don&#39;t allow (as much?) right context.</div><div>&nbsp;     - method:</div><div>&nbsp;     <span class="Apple-tab-span"  style="white-space: pre">	</span>- select only multi-utterance turns; EOUs to find are the</div><div>&nbsp;         turn-internal ones.</div><div>&nbsp;       - use original data &amp; variants </div><div>&nbsp;     - what&#39;s a good way to evaluate this? follow-on effects of wrong</div><div>&nbsp;       decisions: an insert for example makes us restart the parser,</div><div>&nbsp;       and hence get other things wrong?</div><div><br /></div>]]></description>
        </item>    </channel></rss>
