werelax
's Collections
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper
•
2310.09263
•
Published
•
39
A Zero-Shot Language Agent for Computer Control with Structured
Reflection
Paper
•
2310.08740
•
Published
•
14
The Consensus Game: Language Model Generation via Equilibrium Search
Paper
•
2310.09139
•
Published
•
12
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper
•
2310.09199
•
Published
•
24
CodeChain: Towards Modular Code Generation Through Chain of
Self-revisions with Representative Sub-modules
Paper
•
2310.08992
•
Published
•
10
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with
Refined Data Generation
Paper
•
2312.14187
•
Published
•
49
Reasons to Reject? Aligning Language Models with Judgments
Paper
•
2312.14591
•
Published
•
17
Exploiting Novel GPT-4 APIs
Paper
•
2312.14302
•
Published
•
12
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Paper
•
2312.14233
•
Published
•
15
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Paper
•
2312.14385
•
Published
•
5
Shai: A large language model for asset management
Paper
•
2312.14203
•
Published
•
4
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Paper
•
2312.14878
•
Published
•
13
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for
Single Image Talking Face Generation
Paper
•
2312.13578
•
Published
•
27
Generative Multimodal Models are In-Context Learners
Paper
•
2312.13286
•
Published
•
34
Mini-GPTs: Efficient Large Language Models through Contextual Pruning
Paper
•
2312.12682
•
Published
•
8
LLM in a flash: Efficient Large Language Model Inference with Limited
Memory
Paper
•
2312.11514
•
Published
•
258
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided
Document Generation
Paper
•
2312.11532
•
Published
•
5
ProTIP: Progressive Tool Retrieval Improves Planning
Paper
•
2312.10332
•
Published
•
7
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper
•
2312.10003
•
Published
•
35
Self-Evaluation Improves Selective Generation in Large Language Models
Paper
•
2312.09300
•
Published
•
14
Extending Context Window of Large Language Models via Semantic
Compression
Paper
•
2312.09571
•
Published
•
12
Challenges with unsupervised LLM knowledge discovery
Paper
•
2312.10029
•
Published
•
7
Faithful Persona-based Conversational Dataset Generation with Large
Language Models
Paper
•
2312.10007
•
Published
•
6
Perspectives on the State and Future of Deep Learning -- 2023
Paper
•
2312.09323
•
Published
•
5
Zebra: Extending Context Window with Layerwise Grouped Local-Global
Attention
Paper
•
2312.08618
•
Published
•
11
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Paper
•
2306.08568
•
Published
•
28
WizardMath: Empowering Mathematical Reasoning for Large Language Models
via Reinforced Evol-Instruct
Paper
•
2308.09583
•
Published
•
7
Blending Is All You Need: Cheaper, Better Alternative to
Trillion-Parameters LLM
Paper
•
2401.02994
•
Published
•
47
MoE-Mamba: Efficient Selective State Space Models with Mixture of
Experts
Paper
•
2401.04081
•
Published
•
70
CogAgent: A Visual Language Model for GUI Agents
Paper
•
2312.08914
•
Published
•
29
ORPO: Monolithic Preference Optimization without Reference Model
Paper
•
2403.07691
•
Published
•
62
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real
Computer Environments
Paper
•
2404.07972
•
Published
•
44
JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Paper
•
2404.07413
•
Published
•
36
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training
Paper
•
2405.06932
•
Published
•
16