A Python script that extracts text from PDF documents using Google Cloud Vision API and converts tables to Markdown format. Optimized for French documents with ~95% accuracy.
- ✨ High Accuracy OCR: Leverages Google Cloud Vision API for superior text recognition
- 📊 Table Detection: Automatically detects and converts tables to Markdown format
- 🇫🇷 Language Optimized: Configured for French documents (easily customizable)
- 📄 Multi-page Support: Handles PDFs of any size