Changelogs

2023-11-11

Added new OCR model 48px

2023-05-08

Added 4x-UltraSharp upscaler

2023-04-30

Countless bug fixes and refactor
Add CRAFT detector, enable by --detector craft

2022-06-15

Added New inpainting model LaMa MPE by dmMaze and set as default

2022-04-23

Project version is now at beta-0.3

Added English text renderer by dmMaze
Added new CTC based OCR engine, significant speed improvement
The new OCR model now support Korean

2022-03-19

Use new font rendering method by pokedexter
Added manual translation UI by rspreet92

2022-01-24

Added text detection model by dmMaze

2021-08-21

New MST based text region merge algorithm, huge text region merge improvement
Add baidu translator in demo mode
Add google translator in demo mode
Various bugfixes

2021-07-29

Web demo adds translator, detection resolution and target language option
Slight text color extraction improvement

2021-07-26

Major upgrades for all components, now we are on beta!
Note in this version all English texts are detected as capital letters,
You need Python >= 3.8 for cached_property to work

Detection model upgrade
OCR model upgrade, better at text color extraction
Inpainting model upgrade
Major text rendering improvement, faster rendering and higher quality text with shadow
Slight mask generation improvement
Various bugfixes
Default detection resolution has been dialed back to 1536 from 2048

2021-07-09

Fix erroneous image rendering when inpainting is not used

2021-06-18

Support manual translation
Support detection and rendering of angled texts

2021-06-13

Text mask completion is now based on CRF, mask quality is drastically improved

2021-06-10

Improve text rendering

2021-06-09

New text region based text direction detection method
Support running demo as web service

2021-05-20

Text detection model is now based on DBNet with ResNet34 backbone
OCR model is now trained with more English sentences
Inpaint model is now based on AOT which requires far less memory
Default inpainting resolution is now increased to 2048, thanks to the new inpainting model
Support merging hyphenated English words

2021-05-11

Add youdao translate and set as default translator

2021-05-06

Text detection model is now based on DBNet with ResNet101 backbone
OCR model is now deeper
Default detection resolution has been increased to 2048 from 1536

Note this version is slightly better at handling English texts, other than that it is worse in every other ways

2021-03-04

Added inpainting model

2021-02-17

First version launched