VLM4OCR
vlm4ocr is a toolkit for Optical character recognition (OCR) with Vision language models (VLMs). In includes three components:
- Web Application for drag-and-drop access
- CLI for command line access
- Python package for Python access
vlm4ocr is a toolkit for Optical character recognition (OCR) with Vision language models (VLMs). In includes three components: