<node id="666776">
  <nid>666776</nid>
  <type>event</type>
  <uid>
    <user id="36172"><![CDATA[36172]]></user>
  </uid>
  <created>1679589241</created>
  <changed>1679593204</changed>
  <title><![CDATA[CSIP Seminar: Improving Fairness in Speaker Recognition and Speech Recognition]]></title>
  <body><![CDATA[<h3><strong>Center for Signals and Information Processing (CSIP)&nbsp;Seminar</strong></h3>

<p><strong>Date:</strong>&nbsp;Friday, March 31,&nbsp;2023</p>

<p><strong>Time:</strong>&nbsp;3:00 p.m. - 4:00 p.m. EST</p>

<p><strong>Location:&nbsp;</strong>Centergy Building 5126 and <a href="https://gatech.zoom.us/j/97276617058">virtual</a></p>

<p><strong>Speaker:&nbsp;</strong>Andreas Stolcke</p>

<p><strong>Speakers' Title:</strong>&nbsp;Senior Principal Scientist in the Alexa Speech organization at Amazon</p>

<p><strong>Seminar Title:&nbsp;</strong>Improving Fairness in Speaker Recognition and Speech Recognition</p>

<p><strong>Abstract:&nbsp;</strong>Group fairness, or avoidance of large performance disparities for different cohorts of users, is a major concern as AI technologies find adoption in ever more application scenarios. In this talk I will present some recent work on fairness for speech-based technologies, specifically, speaker recognition and speech recognition. For speaker recognition, I report on two algorithmic approaches to reduce performance variability across different groups. In the first method, group-adapted fusion, we combine sub-models that are specialized for subpopulations that have very different representation (and therefore performance) in the data. The second method, adversarial reweighting, forces the model to focus on those portions of the population that are harder to recognize, without requiring a priori labels for speaker groups. For automatic speech recognition, I present methods for detecting and mitigating accuracy disparities as a function of geographic or demographic variables, principally by oversampling or adaptation based on group membership. The talk concludes with an application of synthetic speech generation (TTS) for filling in data gaps for a group of speakers with atypical speech, namely, stutter.</p>

<p><strong>Speaker Bio:</strong>&nbsp;Andreas Stolcke is senior principal scientist in the Alexa Speech organization at Amazon. &nbsp;He obtained his PhD from UC Berkeley and then worked as a researcher at SRI International and Microsoft, before joining Amazon. His research interests include computational linguistics, language modeling, speech recognition, speaker recognition and diarization, and paralinguistics, &nbsp;with over three hundred papers and patents in these areas. His open-source SRI Language Modeling Toolkit was widely used in academia (before becoming obsolete by virtue of deep neural network models). Andreas is a Fellow of t he IEEE and the International Speech Communication Association and giving this talk as an IEEE Distinguished Industry Speaker.</p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Featuring Andreas Stolcke of Alexa Speech organization at Amazon]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>Andreas Stolcke,&nbsp;senior principal scientist in the Alexa Speech organization at Amazon,&nbsp;will present the CSIP Seminar, "Improving Fairness in Speaker Recognition and Speech Recognition."</p>
]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2023-03-31T15:00:00-04:00]]></value>
      <value2><![CDATA[2023-03-31T16:00:00-04:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Faculty/Staff]]></value>
      </item>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
          <item>
        <value><![CDATA[Undergraduate students]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[<p>Kiran Kokilepersaud<br />
<a href="mailto:kpk6@gatech.edu">kpk6@gatech.edu</a><br />
&nbsp;</p>
]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>1255</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[School of Electrical and Computer Engineering]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1795</tid>
        <value><![CDATA[Seminar/Lecture/Colloquium]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>192224</tid>
        <value><![CDATA[CSIP Seminar]]></value>
      </item>
      </field_keywords>
  <userdata><![CDATA[]]></userdata>
</node>
