Text-to-Speech
BERTopic
English
Not-For-All-Audiences
code
medical
text-generation-inference
Mixture of Experts
cannabis
TheGanjaGuru / README.md
PhoenixBlaze420's picture
Update README.md
0ae86e8 verified
metadata
license: apache-2.0
datasets:
  - HuggingFaceFW/fineweb-2
language:
  - en
metrics:
  - accuracy
  - character
base_model:
  - Qwen/Qwen2.5-Coder-32B-Instruct
  - meta-llama/Llama-3.3-70B-Instruct
new_version: meta-llama/Llama-3.3-70B-Instruct
library_name: bertopic
tags:
  - not-for-all-audiences
  - code
  - medical
  - text-generation-inference
  - moe
  - cannabis
pipeline_tag: text-to-speech

Model Card for Model ID

This modelcard aims to be a base template for new models. It has been generated using this raw template.

Model Details

Model Description

  • Developed by: [More Information Needed]
  • Funded by [optional]: [More Information Needed]
  • Shared by [optional]: [More Information Needed]
  • Model type: [More Information Needed]
  • Language(s) (NLP): [More Information Needed]
  • License: [More Information Needed]
  • Finetuned from model [optional]: [More Information Needed]

Model Sources [optional]

  • Repository: [More Information Needed]
  • Paper [optional]: [More Information Needed]
  • Demo [optional]: [More Information Needed]

Uses

Direct Use

The GanjaGuru can be used as an AI-powered virtual assistant for cannabis enthusiasts, growers, and businesses. It provides recommendations on products, cultivation techniques, and grow room design, while also assisting with marketing, sales, and delivery optimization. Users can interact directly with the model for personalized guidance and expert advice.

Downstream Use [optional]

When integrated into a larger ecosystem, such as an e-commerce platform or a cannabis community application, The GanjaGuru can support advanced functionalities like IoT connectivity for smart grow systems, AR/VR-powered shopping experiences, and automated customer support for cannabis-related queries.

Out-of-Scope Use

The GanjaGuru is not suitable for medical advice, non-cannabis-related industries, or applications requiring legal compliance without proper regulation checks. It should not be used for illegal activities, misinformation, or tasks outside its expertise.

Bias, Risks, and Limitations

The GanjaGuru, like any advanced AI system, is subject to certain biases, risks, and limitations:

  • Bias in Recommendations: The model may inadvertently favor products or services based on dataset limitations or biases in the training data. Regular audits and updates are required to ensure fairness and inclusivity.
  • Technical Limitations: The model's accuracy may degrade with outdated data or in scenarios requiring nuanced understanding of user needs.
  • Regulatory Risks: Operating in the cannabis industry involves strict legal compliance, and inaccuracies in product or cultivation advice could lead to costly violations.
  • User Misuse: There's potential for misuse in scenarios where users attempt to obtain non-legitimate advice or circumvent legal restrictions.

[More Information Needed]

Recommendations

  • Conduct routine audits and updates of the training datasets to mitigate biases and maintain accuracy.
  • Implement transparency features that allow users to understand how recommendations are made.
  • Educate users about the limitations and appropriate use of the GanjaGuru.
  • Integrate a feedback loop for continuous improvement based on real-world interactions.

[More Information Needed for further recommendations]

How to Get Started with the Model

Use the following guidelines and resources to implement the GanjaGuru effectively:

[More Information Needed]

Training Details

Training Data

The GanjaGuru's training data comprises comprehensive datasets focused on cannabis-related topics, including cultivation techniques, product recommendations, legal compliance, and consumer preferences. The data integrates information from scientific research, product catalogs, and user-generated insights to ensure a balanced understanding of the cannabis ecosystem. Documentation related to data preprocessing, additional filtering, and dataset cards is still under development.

[More Information Needed]

Training Procedure

The training process involves fine-tuning advanced AI models with a focus on natural language understanding and contextual accuracy specific to the cannabis industry. The procedure integrates iterative learning, bias mitigation, and domain-specific customization to optimize performance.

Preprocessing [Optional]

Preprocessing steps include data normalization, tokenization, and cleaning to ensure consistency and relevance across the datasets used. Enhanced techniques like data augmentation and feature engineering are applied to improve robustness and adaptability.

[More Information Needed]

Training Hyperparameters

  • Training regime: [More Information Needed]

Speeds, Sizes, Times [optional]

[More Information Needed]

Evaluation

Testing Data, Factors & Metrics

Testing Data

[More Information Needed]

Factors

[More Information Needed]

Metrics

[More Information Needed]

Results

[More Information Needed]

Summary

Model Examination [optional]

[More Information Needed]

Environmental Impact

Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019).

  • Hardware Type: [More Information Needed]
  • Hours used: [More Information Needed]
  • Cloud Provider: [More Information Needed]
  • Compute Region: [More Information Needed]
  • Carbon Emitted: [More Information Needed]

Technical Specifications [optional]

Model Architecture and Objective

[More Information Needed]

Compute Infrastructure

[More Information Needed]

Hardware

[More Information Needed]

Software

[More Information Needed]

Citation [optional]

BibTeX:

[More Information Needed]

APA:

[More Information Needed]

Glossary [optional]

[More Information Needed]

More Information [optional]

[More Information Needed]

Model Card Authors [optional]

[More Information Needed]

Model Card Contact

[More Information Needed]