Thank you to arXiv for use of its open access interoperability.
-
Retrieve and store descriptive metadata (titles, abstracts, authors, identifiers, classifications)
-
Search and discover papers through API
-
Retrieve paper content for research purposes
-
Maximum 1 request every 3 seconds for legacy APIs (including the main ArXiv API)
-
Single connection at a time
- Advanced Search: Query arXiv papers with field-specific searches (
ti:,au:,abs:, etc.) - Boolean Logic: Support for
AND,OR,ANDNOToperators - Flexible Sorting: Sort by relevance, submission date, or last updated
- ID-based Lookup: Direct paper retrieval by arXiv ID
- PDF Download: Download papers directly by arXiv ID
- Flexible Paths: Save to custom locations or use auto-generated filenames
- Progress Tracking: File size reporting and download status
- Error Handling: Robust validation and error reporting
- HTML Access: Fetch arXiv papers in HTML format for analysis
- Direct URL Support: Access papers via arXiv HTML interface
- Full Content: Get complete paper content including text and structure
- Research Integration: Perfect for AI agents and research tools
- Universal Image Support: Download images from any URL (PNG, JPEG, GIF, WebP)
- AI Agent Compatible: Returns file paths for vision model analysis
- Smart Storage: Automatic file naming with conflict resolution
- Format Detection: Automatic image format detection from URL/headers
- Research Figures: Perfect for downloading and analyzing paper figures
- Query Parameter Validation
- Required parameters: Ensure either
search_queryorid_listis provided - Parameter combinations: Validate logic between
search_queryandid_list - Unknown parameters: Reject invalid parameter names
- Field Prefix Validation
- Valid prefixes:
ti,au,abs,co,jr,cat,rn,id,all - Boolean operators:
AND,OR,ANDNOT - Syntax checking: Proper field:value format
- Parameter Type Validation
start: Non-negative integermax_results: Positive integer (1-2000 limit)sortBy: Must be "relevance", "lastUpdatedDate", or "submittedDate"sortOrder: Must be "ascending" or "descending"
- ID List Validation
- Old format:
math.GT/0309136v1(subject-class/YYMMnnn) - New format:
2301.00001v1(YYMM.NNNN) - Version numbers: Optional vN suffix
- Comma separation: Multiple IDs properly formatted
- Date Range Validation (from your API spec)
submittedDateformat: [YYYYMMDDTTTT+TO+YYYYMMDDTTTT]- Date logic: Start date before end date
- GMT time format: TTTT in 24-hour format
- URL Encoding Validation
- Special characters: Proper encoding of spaces (+), quotes (%22), parentheses (%28, %29) in
search_query - Reserved characters: Ensure proper escaping in
search_query
Integrate with AI Agents
{
"mcpServers": {
"papers": {
"command": "uv",
"args": [
"--directory",
"/home/locch/Works/mcp-server-papers",
"run",
"mcp_server_papers"
]
}
}
}Or:
{
"mcpServers": {
"papers": {
"serverUrl": "http://localhost:8000/mcp"
}
}
}