Extract clean, structured content from web pages at scale with Valyu’s Contents API - featuring AI-powered data extraction and intelligent summarization
Length | Characters | Best For |
---|---|---|
short | 25,000 | Summaries, key points |
medium | 50,000 | Articles, blog posts |
large | 100,000 | Academic papers, long-form content |
max | Unlimited | Complete document extraction |
Custom integer | 1,000-1,000,000 | Specific requirements |
summary
field accepts four different types of values. Here are examples for each:
false
)true
)string
)object
)summary
field with an object (structured extraction), you can use any valid JSON Schema specification. For detailed information about available types, formats, and validation rules, see the JSON Schema Type Reference.
Key limitations:
string
- Text data with optional format validationnumber
/ integer
- Numeric values with optional min/maxboolean
- True/false valuesarray
- Lists of items with optional size limitsobject
- Nested structures with propertiesextract_effort: "high"
provides better content quality for complex websites
but takes longer to process.Field | Description |
---|---|
content | Extracted content in markdown format |
summary | AI processing result (text or structured data) |
data_type | "unstructured" (no AI) or "structured" (with AI) |
summary_success | Whether AI processing succeeded |
price | Cost for processing this specific URL |
false
: No AI processing (fastest, cheapest)true
: Basic summarization for general overviews"string"
: Custom instructions for specific summary needs{object}
: Structured extraction for data processingsummary_success
for AI processing statustotal_cost_dollars
for budget management