
Zhehao Zhang, a master’s student in computer science, co-developed a new dataset, FalseReject, to help language models better distinguish between truly harmful prompts and those that only seem risky. The project aims to reduce over-refusal behavior in AI systems without compromising safety. Read more on Unite.AI.