Skip to content

VLM4OCR Documentation

Home

VLM4OCR Documentation

Home
Web Application
CLI
Python package
Python package
API Reference
API Reference

VLM4OCR

vlm4ocr is a toolkit for Optical character recognition (OCR) with Vision language models (VLMs). In includes three components:

Web Application for drag-and-drop access
CLI for command line access
Python package for Python access