Skip to content

Latest commit

 

History

History
43 lines (34 loc) · 2.67 KB

File metadata and controls

43 lines (34 loc) · 2.67 KB

ChunkMetadataOutput

Metadata for a chunk including source document references.

Properties

Name Type Description Notes
polygons List[PolygonReference] List of bounding boxes in the source document for the chunk, potentially from multiple areas of multiple pages. [optional]
s3_urls List[str] Ordered s3:// URIs to visual assets for this chunk. Single-element for standard IMAGE/TABLE/HTML chunks, multi-element for multi-page single-chunk ingestion. [optional]
summary str LLM-generated summary of the chunk content. Used for TABLE and HTML chunks to enrich embedding text. [optional]
extracted_text_s3_uri str S3 URI to extracted PDF text used for LLM grounding during enrichment [optional]
secondary_taxonomy ImageTaxonomy [optional]
sheet_name str Worksheet name this chunk was extracted from (XLSX only) [optional]
block_type str XLSXParser block type (e.g. table, calculation_block, chart_anchor) [optional]
source_uri str Cell range URI reference in the source workbook (XLSX only) [optional]
enriched_html str Rendered HTML for non-table XLSX chunks (tables use render_html as content) [optional]
cell_range str Cell address range, e.g. 'A1:D10' (XLSX only) [optional]
dependency_summary Dict[str, object] Upstream/downstream/cross-sheet cell references for audit reasoning (XLSX only) [optional]
formulas List[Dict[str, str]] Formula cells in this chunk as [{address, formula, value}] (XLSX only) [optional]
key_cells List[Dict[str, object]] Notable output/header cells for quick identification (XLSX only) [optional]
named_ranges List[Dict[str, object]] Named ranges overlapping this chunk (XLSX only) [optional]

Example

from ksapi.models.chunk_metadata_output import ChunkMetadataOutput

# TODO update the JSON string below
json = "{}"
# create an instance of ChunkMetadataOutput from a JSON string
chunk_metadata_output_instance = ChunkMetadataOutput.from_json(json)
# print the JSON string representation of the object
print(ChunkMetadataOutput.to_json())

# convert the object into a dict
chunk_metadata_output_dict = chunk_metadata_output_instance.to_dict()
# create an instance of ChunkMetadataOutput from a dict
chunk_metadata_output_from_dict = ChunkMetadataOutput.from_dict(chunk_metadata_output_dict)

[Back to Model list] [Back to API list] [Back to README]