|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
public interface BoilerpipeExtractor
Describes a complete filter pipeline.
| Method Summary | |
|---|---|
java.lang.String |
getText(org.xml.sax.InputSource is)
Extracts text from the HTML code available from the given InputSource. |
java.lang.String |
getText(java.io.Reader r)
Extracts text from the HTML code available from the given Reader. |
java.lang.String |
getText(java.lang.String html)
Extracts text from the HTML code given as a String. |
java.lang.String |
getText(TextDocument doc)
Extracts text from the given TextDocument object. |
| Methods inherited from interface de.l3s.boilerpipe.BoilerpipeFilter |
|---|
process |
| Method Detail |
|---|
java.lang.String getText(java.lang.String html)
throws BoilerpipeProcessingException
html - The HTML code as a String.
BoilerpipeProcessingException
java.lang.String getText(org.xml.sax.InputSource is)
throws BoilerpipeProcessingException
InputSource.
is - The InputSource containing the HTML
BoilerpipeProcessingException
java.lang.String getText(java.io.Reader r)
throws BoilerpipeProcessingException
Reader.
r - The Reader containing the HTML
BoilerpipeProcessingException
java.lang.String getText(TextDocument doc)
throws BoilerpipeProcessingException
TextDocument object.
doc - The TextDocument.
BoilerpipeProcessingException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||