Dec, 2024 | Attending NeurIPS ‘24 to present our work at the Pluralistic Alignment workshop. |
Dec, 2024 | Granite Guardian 3.0 technical report is now out! |
Nov, 2024 | I’ll be presenting our work “Value Alignment From Unstructured Text” at EMNLP 2024 |
Oct, 2024 | Granite Guardian 3.0 is out! It helps detect input and response risks, including various harm and RAG hallucinations. |
Sep, 2024 | CAST: Checkout my exceptional summer intern, Bruce Lee’s, work on conditional activation steering. |
Aug, 2024 | Alignment Studio is accepted to IEEE Internet Computing! We introduce an architecture that facilitates alignment of LMs to specific values, norms and regulations within a context. |