<node id="686091">
  <nid>686091</nid>
  <type>event</type>
  <uid>
    <user id="28475"><![CDATA[28475]]></user>
  </uid>
  <created>1761768939</created>
  <changed>1761769021</changed>
  <title><![CDATA[Ph.D. Dissertation Defense - Pin-Jui Ku]]></title>
  <body><![CDATA[<p><strong>Title</strong><em>:&nbsp; Incorporating Geometric and Consistency Constraints with Deep Models for Robust Phase Reconstruction and Speech Enhancement</em></p><p><strong>Committee:</strong></p><p>Dr.&nbsp;Chin-Hui Lee, ECE, Chair, Advisor</p><p>Dr.&nbsp;Larry Heck, ECE</p><p>Dr.&nbsp;David Anderson, ECE</p><p>Dr.&nbsp;Elliot Moore, ECE</p><p>Dr.&nbsp;Marco Sabato Siniscalchi, U of Palermo</p>]]></body>
  <field_summary_sentence>
    <item>
      <value><![CDATA[Incorporating Geometric and Consistency Constraints with Deep Models for Robust Phase Reconstruction and Speech Enhancement ]]></value>
    </item>
  </field_summary_sentence>
  <field_summary>
    <item>
      <value><![CDATA[<p>This dissertation proposes a new framework for phase estimation that overcomes these limitations and demonstrates its effectiveness across multiple DNN-based SE models. We begin by introducing the first deep state-space-based SE model operating on complex-valued spectrograms. While it surpasses baseline models with a compact U-Net architecture, its estimated phase offers limited improvement over the noisy phase, underscoring the difficulty of direct phase prediction. To address this, we develop a novel explicity consistency-preserving loss that leverages the observation that perceptually high-quality speech arises when magnitude and phase are mutually consistent. Building on this insight, we integrate geometric constraints under additive noise conditions with the consistency principle, resulting in the Multi-Sourced Griffin-Lim Algorithm (MSGLA). MSGLA jointly refines speech and noise phases through iterative updates guided by DNN-estimated magnitudes and geometric relationships, outperforming direct phase estimation and prior geometric methods. Finally, we extend these ideas to a large-scale generative pretraining framework that models the distribution of clean speech spectrograms and incorporates the consistency-based phase loss during training. &nbsp;</p>]]></value>
    </item>
  </field_summary>
  <field_time>
    <item>
      <value><![CDATA[2025-11-19T08:30:00-05:00]]></value>
      <value2><![CDATA[2025-11-19T10:30:00-05:00]]></value2>
      <rrule><![CDATA[]]></rrule>
      <timezone><![CDATA[America/New_York]]></timezone>
    </item>
  </field_time>
  <field_fee>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_fee>
  <field_extras>
      </field_extras>
  <field_audience>
          <item>
        <value><![CDATA[Public]]></value>
      </item>
      </field_audience>
  <field_media>
      </field_media>
  <field_contact>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_contact>
  <field_location>
    <item>
      <value><![CDATA[Room 530, TSRB]]></value>
    </item>
  </field_location>
  <field_sidebar>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_sidebar>
  <field_phone>
    <item>
      <value><![CDATA[]]></value>
    </item>
  </field_phone>
  <field_url>
    <item>
      <url><![CDATA[]]></url>
      <title><![CDATA[]]></title>
            <attributes><![CDATA[]]></attributes>
    </item>
  </field_url>
  <field_email>
    <item>
      <email><![CDATA[]]></email>
    </item>
  </field_email>
  <field_boilerplate>
    <item>
      <nid><![CDATA[]]></nid>
    </item>
  </field_boilerplate>
  <links_related>
      </links_related>
  <files>
      </files>
  <og_groups>
          <item>434381</item>
      </og_groups>
  <og_groups_both>
          <item><![CDATA[ECE Ph.D. Dissertation Defenses]]></item>
      </og_groups_both>
  <field_categories>
          <item>
        <tid>1788</tid>
        <value><![CDATA[Other/Miscellaneous]]></value>
      </item>
      </field_categories>
  <field_keywords>
          <item>
        <tid>100811</tid>
        <value><![CDATA[Phd Defense]]></value>
      </item>
          <item>
        <tid>1808</tid>
        <value><![CDATA[graduate students]]></value>
      </item>
      </field_keywords>
  <field_userdata><![CDATA[]]></field_userdata>
</node>
