Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
tsinghua-ee
/
SALMONN
like
44
Follow
Electronic Engineering @Tsinghua University
10
Automatic Speech Recognition
PyTorch
English
automatic-audio-captioning
automatic-speech-translation
music-captioning
audio-based-storytelling
speech-audio-coreasoning
auditory understanding
arxiv:
2310.13289
arxiv:
2406.15704
License:
apache-2.0
Model card
Files
Files and versions
Community
2
refs/pr/2
SALMONN
5 contributors
History:
32 commits
AdinaY
HF staff
Add paper link
3557781
verified
5 months ago
beats
chore: release v1
about 1 year ago
other_third-party_licenses
chore: release v1
about 1 year ago
qformer
chore: release v1
about 1 year ago
resource
chore: release v1
about 1 year ago
.gitattributes
Safe
56 Bytes
chore: release v1
about 1 year ago
.gitignore
Safe
3.1 kB
chore: release v1
about 1 year ago
LICENSE
Safe
11.3 kB
chore: release v1
about 1 year ago
README.md
Safe
6.08 kB
Add paper link
5 months ago
cli_inference.py
Safe
1.98 kB
chore: add lora alpha
about 1 year ago
model.py
Safe
9.79 kB
chore: release v1
about 1 year ago
requirements.txt
Safe
160 Bytes
Create requirements.txt
about 1 year ago
salmonn_v1.pth
Safe
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch.LongStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.FloatStorage"
What is a pickle import?
400 MB
LFS
Upload salmonn_v1.pth
about 1 year ago
web_demo.py
Safe
7.32 kB
chore: change sac prompot
about 1 year ago