|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectde.l3s.boilerpipe.sax.CommonTagActions
public abstract class CommonTagActions
Defines an action that is to be performed whenever a particular tag occurs during HTML parsing.
| Nested Class Summary | |
|---|---|
static class |
CommonTagActions.BlockTagLabelAction
CommonTagActions for block-level elements, which triggers some LabelAction on the generated
TextBlock. |
static class |
CommonTagActions.Chained
|
static class |
CommonTagActions.InlineTagLabelAction
CommonTagActions for inline elements, which triggers some LabelAction on the generated
TextBlock. |
| Field Summary | |
|---|---|
static TagAction |
TA_ANCHOR_TEXT
Marks this tag as "anchor" (this should usually only be set for the <A> tag). |
static TagAction |
TA_BODY
Marks this tag the body element (this should usually only be set for the <BODY> tag). |
static TagAction |
TA_FONT
Special TagAction for the <FONT> tag, which keeps track of the
absolute and relative font size. |
static TagAction |
TA_IGNORABLE_ELEMENT
Marks this tag as "ignorable", i.e. all its inner content is silently skipped. |
static TagAction |
TA_INLINE
Deprecated. Use TA_INLINE_WHITESPACE instead |
static TagAction |
TA_INLINE_NO_WHITESPACE
Marks this tag a simple "inline" element, which neither generates whitespace, nor a new block. |
static TagAction |
TA_INLINE_WHITESPACE
Marks this tag a simple "inline" element, which generates whitespace, but no new block. |
| Method Summary |
|---|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final TagAction TA_IGNORABLE_ELEMENT
public static final TagAction TA_ANCHOR_TEXT
<A> tag).
Anchor tags may not be nested.
There is a bug in certain versions of NekoHTML which still allows nested tags.
If boilerpipe encounters such nestings, a SAXException is thrown.
public static final TagAction TA_BODY
<BODY> tag).
public static final TagAction TA_INLINE_WHITESPACE
@Deprecated public static final TagAction TA_INLINE
TA_INLINE_WHITESPACE insteadpublic static final TagAction TA_INLINE_NO_WHITESPACE
public static final TagAction TA_FONT
<FONT> tag, which keeps track of the
absolute and relative font size.
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||