Spaces:

feel-fl
/

open-human-feedback-chat

Running

App Files Files Community

burtenshaw commited on about 1 month ago

Commit

9cc6120

1 Parent(s): e9484c6

first refactored commit

Browse files

Files changed (8) hide show

README.md +40 -21
app/README.md +13 -0
app/app.py +205 -0
app/feedback.py +28 -0
ml/README.md +13 -0
ml/eval/kto_generations.json +0 -0
ml/eval/sft_generations.json +0 -0
pyproject.toml +18 -0

README.md CHANGED Viewed

@@ -1,39 +1,58 @@
-![Auto Assign](https://github.com/HF-RLHF-Platform/demo-repository/actions/workflows/auto-assign.yml/badge.svg)
-![Proof HTML](https://github.com/HF-RLHF-Platform/demo-repository/actions/workflows/proof-html.yml/badge.svg)
-# Welcome to your organization's demo respository
-This code repository (or "repo") is designed to demonstrate the best GitHub has to offer with the least amount of noise.
-The repo includes an `index.html` file (so it can render a web page), two GitHub Actions workflows, and a CSS stylesheet dependency.
-# Model-Improvement-Platform-With-RLHF
-Platform being developed at MIT in collaboration with HuggingFace. Aimed at improving performance of existing Large Language Models through real time human feedback loop.
-# HF-RLHF-Platform
 Platform being developed at MIT in collaboration with HuggingFace. Aimed at improving performance of existing Large Language Models through real-time human feedback loop.
 This repository hosts the development of an automated RLHF platform for Hugging Face, where the community can provide real-time feedback on language models. The feedback is automatically integrated into an RLHF pipeline to continuously fine-tune and improve the models.
-# The Feedback Collective
-**Open RLHF on VLMs for Students**
-A community-driven project to improve Vision-Language Models (VLMs) for student-focused tasks.
-Leverages feedback from users and automated RLHF pipelines to continuously improve model performance.
-## Dataset Schema for Project
-### KTO Dataset Structure
-The dataset should be organized into two splits: `train` and `test`.
-Each split contains the following features:
-| **Feature**   | **Type**  | **Description**                                                                      |
-|---------------|-----------|--------------------------------------------------------------------------------------|
-| `prompt`      | `string`  | The input text for the model. This should be a natural language query or input.      |
-| `completion`  | `string`  | The output text generated by the model in response to the `prompt`.                  |
-| `label`       | `bool`    | A binary value (`True` or `False`) indicating whether the `completion` is desirable. |

+---
+title: Feel
+emoji: 🚀
+colorFrom: blue
+colorTo: gray
+sdk: gradio
+sdk_version: 5.8.0
+app_file: app/app.py
+pinned: false
+---
+# Feel
+This is a project to create a continuous training application.
 Platform being developed at MIT in collaboration with HuggingFace. Aimed at improving performance of existing Large Language Models through real-time human feedback loop.
 This repository hosts the development of an automated RLHF platform for Hugging Face, where the community can provide real-time feedback on language models. The feedback is automatically integrated into an RLHF pipeline to continuously fine-tune and improve the models.
+## What is Feel?
+A community-driven project to improve Multilingual Vision-Language Models (VLMs). Leverages feedback from users and automated RLHF pipelines to continuously improve model performance.
+## Why Feel?
+Feel is a platform that enables the community to provide real-time feedback on language models. The feedback is automatically integrated into an RLHF pipeline to continuously fine-tune and improve the models.
+## Repository Structure
+The repository is organized as follows:
+```
+ml/                # Directory for machine learning code
+├── README.md      # Dataset schema and project structure
+├── data/          # Directory for dataset files
+├── models/        # Directory for model files
+app/               # Directory for application code
+├── app.py         # Main application file
+```
+## Installation
+The repository uses `uv` for managing virtual environments. To install `uv`, go [here](https://docs.astral.sh/uv/getting-started/installation/).
+To install the required dependencies, run the following commands:
+### ML Dependencies
+```bash
+uv install ml
+```
+### App Dependencies
+```bash
+uv install app
+```

app/README.md ADDED Viewed

	@@ -0,0 +1,13 @@

+# Config
+```
+export HF_TOKEN=<your-token>
+export MODEL_ID=<your-model-id> # https://huggingface.co/models?inference=warm&pipeline_tag=image-text-to-text&sort=trending
+export BASE_URL=<your-base-url> # https://hf-mirror.com/
+```
+# Run
+```
+python app.py
+```

app/app.py ADDED Viewed

	@@ -0,0 +1,205 @@

+import os
+import uuid
+from base64 import b64encode
+from datetime import datetime
+from mimetypes import guess_type
+from pathlib import Path
+import gradio as gr
+from huggingface_hub import InferenceClient
+from pandas import DataFrame
+from feedback import save_feedback
+client = InferenceClient(
+    token=os.getenv("HF_TOKEN"),
+    model=(
+        os.getenv("MODEL", "meta-llama/Llama-3.2-11B-Vision-Instruct")
+        if not os.getenv("BASE_URL")
+        else None
+    ),
+    base_url=os.getenv("BASE_URL"),
+)
+def add_user_message(history, message):
+    for x in message["files"]:
+        history.append({"role": "user", "content": {"path": x}})
+    if message["text"] is not None:
+        history.append({"role": "user", "content": message["text"]})
+    return history, gr.MultimodalTextbox(value=None, interactive=False)
+def _format_history_as_messages(history: list):
+    messages = []
+    current_role = None
+    current_message_content = []
+    for entry in history:
+        content = entry["content"]
+        if entry["role"] != current_role:
+            if current_role is not None:
+                messages.append(
+                    {"role": current_role, "content": current_message_content}
+                )
+            current_role = entry["role"]
+            current_message_content = []
+        if isinstance(content, tuple):  # Handle file paths
+            for path in content:
+                data_uri = _convert_path_to_data_uri(path)
+                current_message_content.append(
+                    {"type": "image_url", "image_url": {"url": data_uri}}
+                )
+        elif isinstance(content, str):  # Handle text
+            current_message_content.append({"type": "text", "text": content})
+    if current_role is not None:
+        messages.append({"role": current_role, "content": current_message_content})
+    return messages
+def _convert_path_to_data_uri(path) -> str:
+    mime_type, _ = guess_type(path)
+    with open(path, "rb") as image_file:
+        data = image_file.read()
+        data_uri = f"data:{mime_type};base64," + b64encode(data).decode("utf-8")
+    return data_uri
+def _is_file_safe(path) -> bool:
+    try:
+        return Path(path).is_file()
+    except Exception:
+        return False
+def _process_content(content) -> str | list[str]:
+    if isinstance(content, str) and _is_file_safe(content):
+        return _convert_path_to_data_uri(content)
+    elif isinstance(content, list):
+        return _convert_path_to_data_uri(content[0])
+    return content
+def respond_system_message(history: list) -> list:  # -> list:
+    """Respond to the user message with a system message"""
+    messages = _format_history_as_messages(history)
+    response = client.chat.completions.create(
+        messages=messages,
+        max_tokens=2000,
+        stream=False,
+    )
+    content = response.choices[0].message.content
+    # TODO: Add a response to the user message
+    message = gr.ChatMessage(role="assistant", content=content)
+    history.append(message)
+    return history
+def wrangle_like_data(x: gr.LikeData, history) -> DataFrame:
+    """Wrangle conversations and liked data into a DataFrame"""
+    liked_index = x.index[0]
+    output_data = []
+    for idx, message in enumerate(history):
+        if idx == liked_index:
+            message["metadata"] = {"title": "liked" if x.liked else "disliked"}
+        rating = message["metadata"].get("title")
+        if rating == "liked":
+            message["rating"] = 1
+        elif rating == "disliked":
+            message["rating"] = -1
+        else:
+            message["rating"] = None
+        output_data.append(
+            dict([(k, v) for k, v in message.items() if k != "metadata"])
+        )
+    return history, DataFrame(data=output_data)
+def submit_conversation(dataframe, session_id):
+    """ "Submit the conversation to dataset repo"""
+    if dataframe.empty:
+        gr.Info("No messages to submit because the conversation was empty")
+        return (gr.Dataframe(value=None, interactive=False), [])
+    dataframe["content"] = dataframe["content"].apply(_process_content)
+    conversation_data = {
+        "conversation": dataframe.to_dict(orient="records"),
+        "timestamp": datetime.now().isoformat(),
+        "session_id": session_id,
+        "conversation_id": str(uuid.uuid4()),
+    }
+    save_feedback(input_object=conversation_data)
+    gr.Info(f"Submitted {len(dataframe)} messages to the dataset")
+    return (gr.Dataframe(value=None, interactive=False), [])
+with gr.Blocks() as demo:
+    ##############################
+    # Chatbot
+    ##############################
+    session_id = gr.Textbox(
+        interactive=False,
+        value=str(uuid.uuid4()),
+        visible=False,
+    )
+    chatbot = gr.Chatbot(
+        elem_id="chatbot",
+        bubble_full_width=False,
+        type="messages",
+    )
+    chat_input = gr.MultimodalTextbox(
+        interactive=True,
+        file_count="multiple",
+        placeholder="Enter message or upload file...",
+        show_label=False,
+        submit_btn=True,
+    )
+    chat_msg = chat_input.submit(
+        fn=add_user_message, inputs=[chatbot, chat_input], outputs=[chatbot, chat_input]
+    )
+    bot_msg = chat_msg.then(
+        respond_system_message, chatbot, chatbot, api_name="bot_response"
+    )
+    bot_msg.then(lambda: gr.Textbox(interactive=True), None, [chat_input])
+    ##############################
+    # Deal with feedback
+    ##############################
+    dataframe = gr.DataFrame()
+    chatbot.like(
+        fn=wrangle_like_data,
+        inputs=[chatbot],
+        outputs=[chatbot, dataframe],
+        like_user_message=False,
+    )
+    gr.Button(
+        value="Submit conversation",
+    ).click(
+        fn=submit_conversation,
+        inputs=[dataframe, session_id],
+        outputs=[dataframe, chatbot],
+    )
+    demo.load(
+        lambda: str(uuid.uuid4()),
+        inputs=[],
+        outputs=[session_id],
+    )
+demo.launch()

app/feedback.py ADDED Viewed

	@@ -0,0 +1,28 @@

+import json
+import uuid
+from pathlib import Path
+from huggingface_hub import CommitScheduler
+APP_INSTANCE_ID = str(uuid.uuid4())
+feedback_file = Path("user_feedback/") / f"data_{APP_INSTANCE_ID}.json"
+feedback_folder = feedback_file.parent
+scheduler = CommitScheduler(
+    repo_id="ohp-test-conversation",
+    repo_type="dataset",
+    folder_path=feedback_folder,
+    path_in_repo="data",
+    every=1,
+)
+def save_feedback(input_object: dict) -> None:
+    """
+    Append input/outputs and user feedback to a JSON Lines file using a thread lock to avoid concurrent writes from different users.
+    """
+    with scheduler.lock:
+        with feedback_file.open(mode="a") as f:
+            f.write(json.dumps(obj=input_object))
+            f.write("\n")

ml/README.md ADDED Viewed

	@@ -0,0 +1,13 @@

+## Dataset Schema for Project
+### KTO Dataset Structure
+The dataset should be organized into two splits: `train` and `test`.
+Each split contains the following features:
+| **Feature**   | **Type**  | **Description**                                                                      |
+|---------------|-----------|--------------------------------------------------------------------------------------|
+| `prompt`      | `string`  | The input text for the model. This should be a natural language query or input.      |
+| `completion`  | `string`  | The output text generated by the model in response to the `prompt`.                  |
+| `label`       | `bool`    | A binary value (`True` or `False`) indicating whether the `completion` is desirable. |

ml/eval/kto_generations.json DELETED Viewed

The diff for this file is too large to render. See raw diff

ml/eval/sft_generations.json DELETED Viewed

The diff for this file is too large to render. See raw diff

pyproject.toml ADDED Viewed

	@@ -0,0 +1,18 @@

+[project]
+name = "ohp"
+version = "0.1.0"
+description = "A human feedback project"
+readme = "README.md"
+requires-python = ">=3.11"
+dependencies = [
+    "datasets>=3.1.0",
+]
+[dependency-groups]
+ml = [
+    "trl>=0.12.2",
+]
+app = [
+    "gradio>=5.8.0",
+    "huggingface-hub>=0.26.5",
+]