मुख्य पृष्ठ
अनुसंधान सूचीकरण
Leveraging Offline Public Data in Online Differently Private Policy Fine-Tuning

Leveraging Offline Public Data in Online Differently Private Policy Fine-Tuning

Leveraging Offline Public Data in Online Differently Private Policy Fine-Tuning (Prof. Sayak Chowdhury, Computer Science & Engineering)

Modern machine learning models often train on offline data and then learn from online user interactions, raising privacy concerns—especially during fine-tuning stages that involve sensitive data. Differential Privacy (DP) mitigates these risks by adding noise to training, though this can hurt accuracy. Using offline public data helps reduce this trade-off. This project aims to design DP-compliant bandit and reinforcement learning algorithms using such data, with theoretical performance guarantees, and compare them to offline and online baselines. It also seeks to develop DP policy fine-tuning for aligning large language models, ultimately enabling privacy-preserving, trustworthy AI systems such as secure chatbots.

अन्य विशिष्ट अनुसंधान

Hfo2-Based Ferroelectrics for Low Power Memory...

Multispectral Stealth Solutions Covering Visible...

Flexible Solid-Electrolyte Alternative chemistry...

Development and characterization of regulatory...

Cyclic Thermal Testing of TBC Coated Superalloys...

Processing of TFT Array and Liquid Crystal Layer...

Methods of Artificial Intelligence and Magneto-...

Bharat-GPT A Suite of Generative AI Tech for India

Information Security Education and Awareness (...

Design and Development of Quantum Entanglement-...

अपनी प्राथमिकता निर्धारित करें

फ़ॉन्ट स्केलिंग

पृष्ठ अनुमापन

रंग समायोजन

Leveraging Offline Public Data in Online Differently Private Policy Fine-Tuning

अन्य विशिष्ट अनुसंधान