File size: 993 Bytes
2ddc64d
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
# Biweekly Feedback Summary

## 21.10.2022
1. [new dataset](https://huggingface.co/datasets/Adapting/abstract-keyphrases)
1. new model architecture:[KeyBartAdapter](https://github.com/Mondkuchen/idp_LiteratureResearch_Tool/blob/main/lrt/clustering/models/keyBartPlus.py)
    - [train script](https://github.com/Mondkuchen/idp_LiteratureResearch_Tool/blob/main/scripts/train/train.py)
    - [training result](https://huggingface.co/Adapting/KeyBartAdapter)
2. [文献分析平台比较](https://leoxiang66.github.io/LRT-Doc/4-%E6%96%87%E7%8C%AE%E5%88%86%E6%9E%90%E5%B9%B3%E5%8F%B0%E6%AF%94%E8%BE%83/)

## TODOs
- [x] a new keywords generation model
- [x] 比较其他文献工具
- [x] 开始研究query API
- [ ] 可视化
- [ ] 和其他文献工具比较结果
- [ ] 收集更多训练数据/fine-tune
- [ ] add clustering: using other clustering algorithms such as Gausian Mixture Model (GMM)
- [ ] add dimension reduction
- [ ] [better PLM](https://huggingface.co/spaces/mteb/leaderboard)