Overview
Paper2Any is an open-source multimodal workflow platform for academic papers. It converts paper PDFs, screenshots, or text into model architecture diagrams, technical roadmaps, experimental plots, PPT presentations, and more — all with one click.Project Info
- 🔗 Source Code:
github.com/OpenDCAI/Paper2Any - 📜 License: Open Source
- 👤 Organization: OpenDCAI
- ⭐ Community contributed, supports multiple LLMs via APIYI
Why Paper2Any
Multiple Output Formats
Convert papers to architecture diagrams, roadmaps, PPTs, rebuttals, and more — one tool for the entire research workflow
Flexible Model Selection
Dynamically switch between GPT-4o, Claude Sonnet, Qwen-VL and more via API parameters — no hardcoding needed
CLI + Web Dual Mode
Both command-line scripts and web interface available to suit different workflows
OpenAI-Compatible API
Native support for OpenAI-compatible API format — just configure APIYI’s Base URL to access 200+ models
Core Modules
| Module | Description | Output Formats |
|---|---|---|
| Paper2Figure | Generate scientific visualizations from papers | Architecture diagrams, roadmaps (PPTX + SVG), plots |
| Paper2Diagram | Create flowcharts from papers/text/images | draw.io / PNG / SVG |
| Paper2PPT | Convert papers to presentations | PPTX (supports 40+ slides) |
| Paper2Rebuttal | Generate structured rebuttal responses | Rebuttal docs with evidence grounding |
| PDF2PPT | Layout-preserving PDF to editable PPT | PPTX |
| Image2PPT | Transform images/screenshots into slides | PPTX |
| PPTPolish | AI-driven layout optimization | PPTX |
| Knowledge Base | File ingestion, semantic search, KB-driven generation | Multiple formats |
Connect to LLMs via APIYI
Paper2Any supports OpenAI-compatible API format. After configuring APIYI as the LLM endpoint, you can use GPT, Claude, Gemini, DeepSeek, and 200+ other models.Docker Deployment
Step 1: Get Your APIYI API Key
- Visit APIYI Console to register/login
- Go to the Tokens section
- Generate a new API key
- Copy the key (starts with
sk-)
Step 2: Clone and Configure Backend
After cloning the repository, edit Optionally, specify default models for different workflows:
fastapi_app/.env to set APIYI as the LLM endpoint:CLI Usage
Paper2Any provides standalone CLI scripts with--api-url and --api-key parameters for direct APIYI integration:
Deployment Options
| Method | Requirements | Best For |
|---|---|---|
| Docker (Recommended) | One-click frontend + backend startup | Quick start, production |
| Linux Native | Python 3.11+, LaTeX, Inkscape, LibreOffice | Development, customization |
| Windows | Python 3.12, Inkscape | Local use |
FAQ
How do I connect Paper2Any to APIYI models?
How do I connect Paper2Any to APIYI models?
Set
DEFAULT_LLM_API_URL to https://api.apiyi.com/v1 and BACKEND_API_KEY to your APIYI key in the environment variables. For CLI mode, use the --api-url and --api-key parameters.Which models are supported?
Which models are supported?
Via APIYI, you can access 200+ models including GPT-4o, Claude Sonnet 4.5, Gemini, DeepSeek, Qwen, and more. Models can be dynamically switched in the web interface without code changes.
Docker startup fails — what should I check?
Docker startup fails — what should I check?
Verify that:
- Docker and Docker Compose are properly installed
.envfiles are correctly configured- Required ports are not in use
- Check
docker compose logsfor detailed error messages
PPT generation errors or incomplete content?
PPT generation errors or incomplete content?
- Ensure your APIYI account has sufficient balance
- For long papers, use models with larger context windows (e.g., GPT-4o 128K)
- Check that the paper PDF is searchable text (scanned PDFs may produce poor results)
How do I get an APIYI API key?
How do I get an APIYI API key?
Visit the APIYI Console, create an account, and generate a new key in the Tokens section. New users receive free trial credits.
Related Resources
APIYI Model List
View the complete list of 200+ models supported by APIYI
Base URL Configuration
Learn how to configure APIYI Base URL in various tools
APIYI Token Management
Manage API keys, check usage and balance
APIYI Pricing
View model pricing and top-up offers