The Powerful Multi-modal LLM Family for OCR-free Document Understanding. Modularized Multimodal Large Language Model for Document Understanding.