1 2

Qun Gao

qgao007

qgao007

AI & ML interests

None yet

Recent Activity

Reacted to daniel-de-leon's post with 🔥 about 1 month ago

As the rapid adoption of chat bots and QandA models continues, so do the concerns for their reliability and safety. In response to this, many state-of-the-art models are being tuned to act as Safety Guardrails to protect against malicious usage and avoid undesired, harmful output. I published a Hugging Face blog introducing a simple, proof-of-concept, RoBERTa-based LLM that my team and I finetuned to detect toxic prompt inputs into chat-style LLMs. The article explores some of the tradeoffs of fine-tuning larger decoder vs. smaller encoder models and asks the question if "simpler is better" in the arena of toxic prompt detection. 🔗 to blog: https://huggingface.co/blog/daniel-de-leon/toxic-prompt-roberta 🔗 to model: https://huggingface.co/Intel/toxic-prompt-roberta 🔗 to OPEA microservice: https://github.com/opea-project/GenAIComps/tree/main/comps/guardrails/toxicity_detection A huge thank you to my colleagues that helped contribute: @qgao007, @mitalipo, @ashahba and Fahim Mohammad

upvoted an article about 1 month ago

Occam’s Sheath: A Simpler Approach to AI Safety Guardrails

View all activity

Organizations

qgao007's activity

Reacted to daniel-de-leon's post with 🔥 about 1 month ago

Post

2396

As the rapid adoption of chat bots and QandA models continues, so do the concerns for their reliability and safety. In response to this, many state-of-the-art models are being tuned to act as Safety Guardrails to protect against malicious usage and avoid undesired, harmful output. I published a Hugging Face blog introducing a simple, proof-of-concept, RoBERTa-based LLM that my team and I finetuned to detect toxic prompt inputs into chat-style LLMs. The article explores some of the tradeoffs of fine-tuning larger decoder vs. smaller encoder models and asks the question if "simpler is better" in the arena of toxic prompt detection.

🔗 to blog: https://huggingface.co/blog/daniel-de-leon/toxic-prompt-roberta
🔗 to model: Intel/toxic-prompt-roberta
🔗 to OPEA microservice: https://github.com/opea-project/GenAIComps/tree/main/comps/guardrails/toxicity_detection

A huge thank you to my colleagues that helped contribute: @qgao007 , @mitalipo , @ashahba and Fahim Mohammad

upvoted an article about 1 month ago

Article

Occam’s Sheath: A Simpler Approach to AI Safety Guardrails

•

Oct 18

• 8

New activity in Intel/low_bit_open_llm_leaderboard 6 months ago

Provide information for Intel's vulnerability reporting process

#3 opened 6 months ago by

qgao007

Reacted to andrewyng's post with 👍 9 months ago

Post

DeepLearning.AI just announced a new short course: Open Source Models with Hugging Face 🤗, taught by Hugging Face's own Maria Khalusova, Marc Sun and Younes Belkada!

As many of you already know, Hugging Face has been a game changer by letting developers quickly grab any of hundreds of thousands of already-trained open source models to assemble into new applications. This course teaches you best practices for building this way, including how to search and choose among models.

You'll learn to use the Transformers library and walk through multiple models for text, audio, and image processing, including zero-shot image segmentation, zero-shot audio classification, and speech recognition. You'll also learn to use multimodal models for visual question answering, image search, and image captioning. Finally, you’ll learn how to demo what you build locally, on the cloud, or via an API using Gradio and Hugging Face Spaces.

Thank you very much to Hugging Face's wonderful team for working with us on this.

You can sign up for the course here: https://www.deeplearning.ai/short-courses/open-source-models-hugging-face/