|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectde.l3s.boilerpipe.extractors.CommonExtractors
public final class CommonExtractors
Provides quick access to common BoilerpipeExtractors.
| Field Summary | |
|---|---|
static ArticleExtractor |
ARTICLE_EXTRACTOR
Works very well for most types of Article-like HTML. |
static CanolaExtractor |
CANOLA_EXTRACTOR
Trained on krdwrd Canola (different definition of "boilerplate"). |
static DefaultExtractor |
DEFAULT_EXTRACTOR
Usually worse than ArticleExtractor, but simpler/no heuristics. |
static KeepEverythingExtractor |
KEEP_EVERYTHING_EXTRACTOR
Dummy Extractor; should return the input text. |
| Method Summary |
|---|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final ArticleExtractor ARTICLE_EXTRACTOR
public static final DefaultExtractor DEFAULT_EXTRACTOR
ArticleExtractor, but simpler/no heuristics.
public static final CanolaExtractor CANOLA_EXTRACTOR
public static final KeepEverythingExtractor KEEP_EVERYTHING_EXTRACTOR
BoilerpipeExtractor, or
somewhere else.
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||