Changelogs
2023-11-11
- Added new OCR model
48px
2023-05-08
- Added 4x-UltraSharp upscaler
2023-04-30
- Countless bug fixes and refactor
- Add CRAFT detector, enable by
--detector craft
2022-06-15
- Added New inpainting model LaMa MPE by dmMaze and set as default
2022-04-23
Project version is now at beta-0.3
- Added English text renderer by dmMaze
- Added new CTC based OCR engine, significant speed improvement
- The new OCR model now support Korean
2022-03-19
- Use new font rendering method by pokedexter
- Added manual translation UI by rspreet92
2022-01-24
- Added text detection model by dmMaze
2021-08-21
- New MST based text region merge algorithm, huge text region merge improvement
- Add baidu translator in demo mode
- Add google translator in demo mode
- Various bugfixes
2021-07-29
- Web demo adds translator, detection resolution and target language option
- Slight text color extraction improvement
2021-07-26
Major upgrades for all components, now we are on beta!
Note in this version all English texts are detected as capital letters,
You need Python >= 3.8 for cached_property
to work
- Detection model upgrade
- OCR model upgrade, better at text color extraction
- Inpainting model upgrade
- Major text rendering improvement, faster rendering and higher quality text with shadow
- Slight mask generation improvement
- Various bugfixes
- Default detection resolution has been dialed back to 1536 from 2048
2021-07-09
- Fix erroneous image rendering when inpainting is not used
2021-06-18
- Support manual translation
- Support detection and rendering of angled texts
2021-06-13
- Text mask completion is now based on CRF, mask quality is drastically improved
2021-06-10
- Improve text rendering
2021-06-09
- New text region based text direction detection method
- Support running demo as web service
2021-05-20
- Text detection model is now based on DBNet with ResNet34 backbone
- OCR model is now trained with more English sentences
- Inpaint model is now based on AOT which requires far less memory
- Default inpainting resolution is now increased to 2048, thanks to the new inpainting model
- Support merging hyphenated English words
2021-05-11
- Add youdao translate and set as default translator
2021-05-06
- Text detection model is now based on DBNet with ResNet101 backbone
- OCR model is now deeper
- Default detection resolution has been increased to 2048 from 1536
Note this version is slightly better at handling English texts, other than that it is worse in every other ways
2021-03-04
- Added inpainting model
2021-02-17
- First version launched