<node id="670909">
  <nid>670909</nid>
  <type>event</type>
  <uid>
    <user id="27707"><![CDATA[27707]]></user>
  </uid>
  <created>1699302882</created>
  <changed>1699987868</changed>
  <title><![CDATA[PhD Defense by Viraj Prabhu]]></title>
  <body><![CDATA[<p><span><span><strong><span><span>Title</span></span></strong><span><span>: Towards Reliable Computer Vision Systems</span></span></span></span></p>

<p>&nbsp;</p>

<p><span><span><strong><span><span>Date</span></span></strong><span><span>: Monday, November 20, 2023</span></span></span></span></p>

<p><span><span><strong><span><span>Time</span></span></strong><span><span>: 4:00-6:00pm (ET)</span></span></span></span></p>

<p><span><span><strong><span><span>Location: </span></span></strong><span><span>CODA C1115 (Druid Hills) &amp; </span></span><span><a href="https://gatech.zoom.us/j/92801380802?pwd=N3VWNXBRSGYyckVaRWlXaWVwSnZEdz09&amp;from=addon"><span>Zoom</span></a></span></span></span></p>

<p>&nbsp;</p>

<p><span><span><strong><span><span>Viraj Prabhu</span></span></strong></span></span></p>

<p><span><span><span><span>PhD student in Computer Science</span></span></span></span></p>

<p><span><span><span><span>School of Interactive Computing</span></span></span></span></p>

<p><span><span><span><span>Georgia Institute of Technology</span></span></span></span></p>

<p><span><span>&nbsp;</span></span></p>

<p><span><span><strong><span><span>Committee</span></span></strong></span></span></p>

<p><span><span><span><span>Dr. Judy Hoffman (advisor), School of Interactive Computing, Georgia Institute of Technology</span></span></span></span></p>

<p><span><span><span><span>Dr. Dhruv Batra, School of Interactive Computing, Georgia Institute of Technology &amp; Meta</span></span></span></span></p>

<p><span><span><span><span>Dr. James Hays, School of Interactive Computing, Georgia Institute of Technology</span></span></span></span></p>

<p><span><span><span><span>Dr. Zsolt Kira, School of Interactive Computing, Georgia Institute of Technology</span></span></span></span></p>

<p><span><span><span><span>Dr. Sanja Fidler, University of Toronto &amp; NVIDIA</span></span></span></span></p>

<p>&nbsp;</p>

<p><span><span><strong><span><span>Abstract</span></span></strong></span></span></p>

<p><span><span><span><span>The real world has infinite visual variation – across viewpoints, time, space, and curation. As deep visual models become ubiquitous in high-stakes applications, their ability to generalize across such variation becomes increasingly important. Such generalization will alleviate the need to label a large corpus for every new deployment, which may be infeasible due to data volume (e.g., autonomous driving) or labeling cost (e.g., medical diagnosis). Further, it is necessary to overcome the natural spatiotemporal distribution shifts that a deployed model will invariably face (e.g., changing geographies and seasons). Finally, such generalization will unlock the possibility of knowledge transfer from inexpensive sources of data (e.g., transferring models trained in simulation to reality).&nbsp;</span></span></span></span></p>

<p><br />
<span><span><span><span>In this thesis, I will present opportunities to improve such generalization at different stages of the ML lifecycle. First, I will discuss <em>proactive</em> strategies for training robust models by leveraging simulation to augment the long tail of real training data. Next, I will present <em>reactive</em> strategies to recover from unforeseen distribution shifts via self-supervised domain adaptation. Finally, I will present a framework to <em>stress-test </em>the robustness of vision models by leveraging foundation models for text and image synthesis to generate challenging counterfactual test cases.</span></span></span></span></p>
]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[ Towards Reliable Computer Vision Systems]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p><span><span><span>&nbsp;Towards Reliable Computer Vision Systems</span></span></span></p>
]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2023-11-20T16:00:00-05:00]]></value>
      <value2><![CDATA[2023-11-20T18:00:00-05:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Coda C1115 and Zoom]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>221981</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[Graduate Studies]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
      </field_keywords>
  <userdata><![CDATA[]]></userdata>
</node>
