--- license: apache-2.0 tags: - inbora studio - drchamyoung - Neural Network - DLL - Deep ONNX - Behaviour Agents --- # Xieral Code Gen 3B Xieral Code Gen 3B is a decoder-only language model with 2.7 billion parameters. Developed from the `Xieral-Code-Gen-3b`, this model is designed specifically for code generation and software engineering tasks. ## Model Overview - **Architecture**: Decoder-only language model - **Parameters**: 2.7 billion - **Training Data**: Combination of publicly available and synthetic datasets - **Optimization**: Direct Preference Optimization (DPO) - **Fine-tuning**: General code/software engineering conversations, SQL query generation, and discussion ## Performance Xieral Code Gen 3B has demonstrated competitive performance compared to other models of similar size: - **MultiPL-E Metrics**: Evaluated across various programming languages using the BigCode Evaluation Harness. - **MT Bench**: Shows strong results on code-related tasks. ## Usage This model is well-suited for: - General code/software engineering conversations - SQL query generation and discussion ### Requirements To run Xieral Code Gen 3B locally, you will need: - **VRAM**: 8GB+ (Graphics card with sufficient VRAM) - **Dependencies**: Ensure you have the necessary libraries and environment set up to run the model. ## Installation To install the required dependencies, use: ```bash pip install -r requirements.txt