{"668479":{"#nid":"668479","#data":{"type":"event","title":"PhD Proposal by Ashutosh Baheti","body":[{"value":"\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cstrong\u003E\u003Cspan\u003E\u003Cspan\u003ETitle:\u0026nbsp;\u003C\/span\u003E\u003C\/span\u003E\u003C\/strong\u003E\u003Cspan\u003E\u003Cspan\u003ETOWARDS FINE-GRAINED MULTI-ATTRIBUTE CONTROL USING LANGUAGE MODELS\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003Cbr \/\u003E\r\n\u0026nbsp;\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cstrong\u003E\u003Cspan\u003EDate:\u003C\/span\u003E\u003C\/strong\u003E\u003Cspan\u003E\u0026nbsp;\u003Cspan\u003EFriday,\u0026nbsp;\u003C\/span\u003E21\u003C\/span\u003Est\u0026nbsp;July, 2023\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003Cbr \/\u003E\r\n\u0026nbsp;\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cstrong\u003E\u003Cspan\u003ETime\u003C\/span\u003E\u003C\/strong\u003E\u003Cspan\u003E: \u003C\/span\u003E\u003Cspan\u003E\u003Cspan\u003E\u0026nbsp;11:30 AM to 1:30 PM ET\u0026nbsp; \u0026nbsp; \u0026nbsp;(\u003C\/span\u003E\u003C\/span\u003E\u003Cspan\u003E8:30 AM - 10:30 AM PT)\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cstrong\u003E\u003Cspan\u003ELocation: \u003C\/span\u003E\u003C\/strong\u003E\u003Cspan\u003EVirtual | Zoom link -\u003C\/span\u003E\u0026nbsp;\u003Ca href=\u0022https:\/\/gatech.zoom.us\/j\/93320367440?pwd=REhiamxVREcwdUF5Z21XTXJ1NmFWUT09\u0026amp;from=addon\u0022\u003E\u003Cspan\u003E\u003Cspan\u003Ehttps:\/\/gatech.zoom.us\/j\/93320367440?pwd=REhiamxVREcwdUF5Z21XTXJ1NmFWUT09\u0026amp;from=addon\u003C\/span\u003E\u003C\/span\u003E\u003C\/a\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cstrong\u003E\u003Cspan\u003EAshutosh Baheti\u003C\/span\u003E\u003C\/strong\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EPhD student\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003ESchool of Interactive Computing\u003Cbr \/\u003E\r\nGeorgia Institute of Technology\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cstrong\u003E\u003Cspan\u003ECommittee:\u003C\/span\u003E\u003C\/strong\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EProf. Mark Riedl (Advisor) -- School of Interactive Computing, Georgia Institute of Technology\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EProf. Alan Ritter (Co-Advisor)\u0026nbsp;-- School of Interactive Computing, Georgia Institute of Technology\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EProf. Dhruv Batra\u0026nbsp;-- School of Interactive Computing, Georgia Institute of Technology\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EProf. Munmun de Choudhury\u0026nbsp;-- School of Interactive Computing, Georgia Institute of Technology\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EProf. Maarten Sap --\u0026nbsp;Language Technologies Institute, Carnegie Mellon University\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cstrong\u003E\u003Cspan\u003EAbstract\u003C\/span\u003E\u003C\/strong\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003ERecent advancements in pretraining large language models have resulted in their remarkable ability to generate complex and human-proficient language. Consequently, these models have gained widespread adoption as complex problem-solving chatbots and writing assistants. However, as we increasingly rely on these powerful language models, ensuring their safe and effective operation necessitates extensive research in controllable text generation. Existing methods manipulate the decoding process, use data augmentation or online reinforcement learning methods to encourage models to generate responses with the desired attributes. However, even the state-of-the-art language models struggle to generate the most accurate or desired output at the first attempt. Inspired by recent developments in self-correction in large language models and new reinforcement learning methods, we aim to train smaller language models as fine-grained editors, whereby they iteratively edit outputs to satisfy threshold constraints over multiple classifier-based attributes.\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026nbsp;\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003EIn this thesis, I show preliminary work to incorporate per-token distributional constraints during decoding and improve the generation quality of traditional LSTM-based dialog models. Later, I show a study of contextual offensive behavior of pretrained large language models and curate a high-quality dataset for toxicity detection. We also experiment with preliminary controlled text generation methods to decrease the dialog model\u0027s toxicity and agreement in offensive contexts. Next, I introduce a novel offline RL algorithm that can utilize arbitrary numeric scores as rewards during training to optimize any user-desired LM behavior. Building on this offline RL framework, I propose a fine-grained multi-attribute controllability task, where the goal is to guide the language model to generate output sequences that satisfy user-defined threshold-based attribute constraints. We frame the problem as an editing game, where the language model can take multiple edits to reach the desired attributes. Interestingly, our method uses Offline RL to cheaply train LM editors without any exploration.\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n","summary":"","format":"limited_html"}],"field_subtitle":"","field_summary":[{"value":"\u003Cp\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003E\u003Cspan\u003ETOWARDS FINE-GRAINED MULTI-ATTRIBUTE CONTROL USING LANGUAGE MODELS\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/span\u003E\u003C\/p\u003E\r\n","format":"limited_html"}],"field_summary_sentence":[{"value":"TOWARDS FINE-GRAINED MULTI-ATTRIBUTE CONTROL USING LANGUAGE MODELS"}],"uid":"27707","created_gmt":"2023-07-14 15:30:26","changed_gmt":"2023-07-14 15:30:26","author":"Tatianna Richardson","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2023-07-21T11:30:00-04:00","event_time_end":"2023-07-21T13:30:00-04:00","event_time_end_last":"2023-07-21T13:30:00-04:00","gmt_time_start":"2023-07-21 15:30:00","gmt_time_end":"2023-07-21 17:30:00","gmt_time_end_last":"2023-07-21 17:30:00","rrule":null,"timezone":"America\/New_York"},"location":"REMOTE","extras":[],"groups":[{"id":"221981","name":"Graduate Studies"}],"categories":[],"keywords":[{"id":"102851","name":"Phd proposal"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1788","name":"Other\/Miscellaneous"}],"invited_audience":[{"id":"78771","name":"Public"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[],"email":[],"slides":[],"orientation":[],"userdata":""}}}