public class EditableTextExtractor extends TextExtractor
getText() to return all
the text that the user can typically edit in a document, including text in cotent.xml, header and
footer in styles.xml, meta data in meta.xml.
This function can be used by search engine, and text analytic operations.
OdfElement,
DefaultElementVisitorTextExtractor.ExtractorStringBuildermTextBuilder, NewLineChar, TabChar| Modifier and Type | Method and Description |
|---|---|
String |
getText()
Return the editable text content as a string
|
static String |
getText(Document doc)
Return the text content of document as String
|
static String |
getText(OdfElement ele)
Return the text content of a element as String
|
static EditableTextExtractor |
newOdfEditableTextExtractor(Document doc)
An instance of EditableTextExtractor will be created to extract the editable text content in
specified document.
|
static EditableTextExtractor |
newOdfEditableTextExtractor(OdfElement element)
An instance of EditableTextExtractor will be created to extract the editable text content of an
ODF element.
|
void |
visit(DrawObjectElement element)
The end users needn't to care of this method, if you don't want to override the text content
handling strategy of draw:object.
|
void |
visit(TableTableElement ele)
The end users needn't to care of this method, if you don't want to override the text content
handling strategy of table:table.
|
void |
visit(TextAElement ele)
The end users needn't to care of this method, if you don't want to override the text content
handling strategy of text:a.
|
void |
visit(TextTrackedChangesElement ele)
The end users needn't to care of this method, if you don't want to override the text content
handling strategy of text:tracked-changes.
|
appendElementText, newOdfTextExtractor, visit, visit, visit, visit, visit, visitvisit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visit, visitpublic static EditableTextExtractor newOdfEditableTextExtractor(Document doc)
doc - the document whose text will be extracted.public static EditableTextExtractor newOdfEditableTextExtractor(OdfElement element)
element - the ODF element whose text will be extracted.public static String getText(OdfElement ele)
ele - the ODF elementpublic static String getText(Document doc)
doc - the documentpublic String getText()
getText in class TextExtractorpublic void visit(DrawObjectElement element)
visit in class DefaultElementVisitorDefaultElementVisitor.visit(org.odftoolkit.odfdom.dom.element.draw.DrawObjectElement)public void visit(TextTrackedChangesElement ele)
visit in class DefaultElementVisitorDefaultElementVisitor.visit(org.odftoolkit.odfdom.dom.element.text.TextTrackedChangesElement)public void visit(TextAElement ele)
visit in class DefaultElementVisitorDefaultElementVisitor.visit(org.odftoolkit.odfdom.dom.element.text.TextAElement)public void visit(TableTableElement ele)
visit in class DefaultElementVisitorDefaultElementVisitor.visit(org.odftoolkit.odfdom.dom.element.text.TextTabElement)Copyright © 2010–2018 Apache Software Foundation; Copyright © 2018–2020 The Document Foundation. All rights reserved.