Home
Featured Research
Leveraging Offline Public Data in Online Differently Private Policy Fine-Tuning

Leveraging Offline Public Data in Online Differently Private Policy Fine-Tuning

Leveraging Offline Public Data in Online Differently Private Policy Fine-Tuning (Prof. Sayak Chowdhury, Computer Science & Engineering)

Modern machine learning models often train on offline data and then learn from online user interactions, raising privacy concerns—especially during fine-tuning stages that involve sensitive data. Differential Privacy (DP) mitigates these risks by adding noise to training, though this can hurt accuracy. Using offline public data helps reduce this trade-off. This project aims to design DP-compliant bandit and reinforcement learning algorithms using such data, with theoretical performance guarantees, and compare them to offline and online baselines. It also seeks to develop DP policy fine-tuning for aligning large language models, ultimately enabling privacy-preserving, trustworthy AI systems such as secure chatbots.

More Research Highlights

Hfo2-Based Ferroelectrics for Low Power Memory…

Multispectral Stealth Solutions Covering Visible…

Flexible Solid-Electrolyte Alternative chemistry…

Development and characterization of regulatory…

Cyclic Thermal Testing of TBC Coated Superalloys…

Processing of TFT Array and Liquid Crystal Layer…

Methods of Artificial Intelligence and Magneto-…

Bharat-GPT A Suite of Generative AI Tech for India

Information Security Education and Awareness (…

Design and Development of Quantum Entanglement-…

Set your preference

Font Scaling

Page Scaling

Color Adjustment

Leveraging Offline Public Data in Online Differently Private Policy Fine-Tuning

More Research Highlights