|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectde.l3s.boilerpipe.extractors.ExtractorBase
de.l3s.boilerpipe.extractors.LargestContentExtractor
public final class LargestContentExtractor
A full-text extractor which extracts the largest text component of a page.
For news articles, it may perform better than the DefaultExtractor,
but usually worse than ArticleExtractor.
| Field Summary | |
|---|---|
static LargestContentExtractor |
INSTANCE
|
| Method Summary | |
|---|---|
static LargestContentExtractor |
getInstance()
Returns the singleton instance for LargestContentExtractor. |
boolean |
process(TextDocument doc)
Processes the given document doc. |
| Methods inherited from class de.l3s.boilerpipe.extractors.ExtractorBase |
|---|
getText, getText, getText, getText, getText |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final LargestContentExtractor INSTANCE
| Method Detail |
|---|
public static LargestContentExtractor getInstance()
LargestContentExtractor.
public boolean process(TextDocument doc)
throws BoilerpipeProcessingException
BoilerpipeFilterdoc.
doc - The TextDocument that is to be processed.
true if changes have been made to the
TextDocument.
BoilerpipeProcessingException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||