Metadata Overview
The following metadata elements may be returned depending on the file type, the document structure and parameters used to call the MetaGlance service.
See metadata details for more information.
Content Analysis:
- subject: Keywords in a comma separated list, including single and multi-word phrases. Can also be interpreted as possible tags for the content.
- title: Title of a document. Not available for text.
- language: The two letter ISO 639-1 code.
- readinglevel: The approximate reading level of the text as determined using the Flesch-Kincaid measure. For example: "12" is around a U.S. 12th grade reading level.
- readingtime: Estimated reading time in seconds, based on the length and complexity of the text. Intended as a general measure.
File Properties:
- identifier: Identifier of the resource for which metadata was generated. By default this is the URL given to it, or blank for a text passage. You can also specify an alternative, see metadata details.
- format: File type, for example: "Web page" or "Flash animation."
- mimetype: MIME type, for example "text/html" or "application/x-shockwave-flash."
- mediatype: A broader generalization of the format, usually "Text" or "Image."
- pages: Number of pages in a document.
- size: Size of file in bytes.
Statistical Data:
- wordcount: The total number of words.
- sentencecount: Number of sentences.
- averagewordlength: Average characters per word.
Beta Elements:
These elements may be returned by using a beta key. See metadata details for more information.
- abstract: Section of a scholarly paper labelled "Abstract."
- classification: Broad subject area (e.g. "business" or "education")
- creator: Author or other creator.
- date: Various possible dates associated with a document. (modified, date returned by the server, date published.)
- modewordlength: The most common word length.
- shortdescription: A one or two sentence description auto-generated by MetaGlance.
- syllablecount: Total number of syllables in the text.
- uniquewordcount: Number of unique words in the text.
