|
HTML Parser Home Page | |||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.htmlparser.PrototypicalNodeFactory
public class PrototypicalNodeFactory
A node factory based on the prototype pattern.
This factory uses the prototype pattern to generate new nodes.
These are cloned as needed to form new Text
, Remark
and
Tag
nodes.
Text and remark nodes are generated from prototypes accessed
via the textPrototype
and
remarkPrototype
properties respectively.
Tag nodes are generated as follows:
Prototype tags, in the form of undifferentiated tags, are held in a hash
table. On a request for a tag, the attributes are examined for the name
of the tag to be created. If a prototype of that name has been registered
(exists in the hash table), it is cloned and the clone is given the
characteristics (Attributes
, start and end position)
of the requested tag.
In the case that no tag has been registered under that name,
a generic tag is created from the prototype acessed via the
tagPrototype
property.
The hash table of registered tags can be automatically populated with
all the known tags from the org.htmlparser.tags
package when
the factory is constructed, or it can start out empty and be populated
explicitly.
Here is an example of how to override all text issued from
Text.toPlainTextString()
,
in this case decoding (converting character references),
which illustrates the use of setting the text prototype:
PrototypicalNodeFactory factory = new PrototypicalNodeFactory (); factory.setTextPrototype ( // create a inner class that is a subclass of TextNode new TextNode () { public String toPlainTextString() { String original = super.toPlainTextString (); return (org.htmlparser.util.Translate.decode (original)); } }); Parser parser = new Parser (); parser.setNodeFactory (factory);
Here is an example of using a custom link tag, in this case just printing the URL, which illustrates registering a tag:
class PrintingLinkTag extends LinkTag { public void doSemanticAction () throws ParserException { System.out.println (getLink ()); } } PrototypicalNodeFactory factory = new PrototypicalNodeFactory (); factory.registerTag (new PrintingLinkTag ()); Parser parser = new Parser (); parser.setNodeFactory (factory);
Field Summary | |
---|---|
protected Map |
mBlastocyst
The list of tags to return. |
protected Remark |
mRemark
The prototypical remark node. |
protected Tag |
mTag
The prototypical tag node. |
protected Text |
mText
The prototypical text node. |
Constructor Summary | |
---|---|
PrototypicalNodeFactory()
Create a new factory with all tags registered. |
|
PrototypicalNodeFactory(boolean empty)
Create a new factory. |
|
PrototypicalNodeFactory(Tag tag)
Create a new factory with the given tag as the only registered tag. |
|
PrototypicalNodeFactory(Tag[] tags)
Create a new factory with the given tags registered. |
Method Summary | |
---|---|
void |
clear()
Clean out the registry. |
Remark |
createRemarkNode(Page page,
int start,
int end)
Create a new remark node. |
Text |
createStringNode(Page page,
int start,
int end)
Create a new string node. |
Tag |
createTagNode(Page page,
int start,
int end,
Vector attributes)
Create a new tag node. |
Tag |
get(String id)
Gets a tag from the registry. |
Remark |
getRemarkPrototype()
Get the object that is cloned to generate remark nodes. |
Set |
getTagNames()
Get the list of tag names. |
Tag |
getTagPrototype()
Get the object that is cloned to generate tag nodes. |
Text |
getTextPrototype()
Get the object that is cloned to generate text nodes. |
Tag |
put(String id,
Tag tag)
Adds a tag to the registry. |
void |
registerTag(Tag tag)
Register a tag. |
PrototypicalNodeFactory |
registerTags()
Register all known tags in the tag package. |
Tag |
remove(String id)
Remove a tag from the registry. |
void |
setRemarkPrototype(Remark remark)
Set the object to be used to generate remark nodes. |
void |
setTagPrototype(Tag tag)
Set the object to be used to generate tag nodes. |
void |
setTextPrototype(Text text)
Set the object to be used to generate text nodes. |
void |
unregisterTag(Tag tag)
Unregister a tag. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
protected Text mText
protected Remark mRemark
protected Tag mTag
protected Map mBlastocyst
Constructor Detail |
---|
public PrototypicalNodeFactory()
PrototypicalNodeFactory(false)
.
public PrototypicalNodeFactory(boolean empty)
empty
- If true
, creates an empty factory,
otherwise create a new factory with all tags registered.public PrototypicalNodeFactory(Tag tag)
tag
- The single tag to register in the otherwise empty factory.public PrototypicalNodeFactory(Tag[] tags)
tags
- The tags to register in the otherwise empty factory.Method Detail |
---|
public Tag put(String id, Tag tag)
id
- The name under which to register the tag.
For proper operation, the id should be uppercase so it
will be matched by a Map lookup.tag
- The tag to be returned from a createTagNode(org.htmlparser.lexer.Page, int, int, java.util.Vector)
call.
null
if none.public Tag get(String id)
id
- The name of the tag to return.
id
name,
or null
if none.public Tag remove(String id)
id
- The name of the tag to remove.
id
,
or null
if none.public void clear()
public Set getTagNames()
public void registerTag(Tag tag)
id
that the
tag has (i.e. all names returned by tag.getIds()
.
For proper operation, the ids are converted to uppercase so they will be matched by a Map lookup.
tag
- The tag to register.public void unregisterTag(Tag tag)
id
the tag has.
The ids are converted to uppercase to undo the operation of registerTag.
tag
- The tag to unregister.public PrototypicalNodeFactory registerTags()
tag package
by
calling registerTag()
.
public Text getTextPrototype()
Text
nodes.setTextPrototype(org.htmlparser.Text)
public void setTextPrototype(Text text)
text
- The prototype for Text
nodes.
If null
the prototype is set to the default
(TextNode
).getTextPrototype()
public Remark getRemarkPrototype()
Remark
nodes.setRemarkPrototype(org.htmlparser.Remark)
public void setRemarkPrototype(Remark remark)
remark
- The prototype for Remark
nodes.
If null
the prototype is set to the default
(RemarkNode
).getRemarkPrototype()
public Tag getTagPrototype()
createTagNode(org.htmlparser.lexer.Page, int, int, java.util.Vector)
when no
specific tag is found in the list of registered tags.
Tag
nodes.setTagPrototype(org.htmlparser.Tag)
public void setTagPrototype(Tag tag)
createTagNode(org.htmlparser.lexer.Page, int, int, java.util.Vector)
when no
specific tag is found in the list of registered tags.
tag
- The prototype for Tag
nodes.
If null
the prototype is set to the default
(TagNode
).getTagPrototype()
public Text createStringNode(Page page, int start, int end)
createStringNode
in interface NodeFactory
page
- The page the node is on.start
- The beginning position of the string.end
- The ending position of the string.
public Remark createRemarkNode(Page page, int start, int end)
createRemarkNode
in interface NodeFactory
page
- The page the node is on.start
- The beginning position of the remark.end
- The ending positiong of the remark.
public Tag createTagNode(Page page, int start, int end, Vector attributes)
createTagNode
in interface NodeFactory
page
- The page the node is on.start
- The beginning position of the tag.end
- The ending positiong of the tag.attributes
- The attributes contained in this tag.
|
© 2006 Derrick Oswald Sep 17, 2006
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
HTML Parser is an open source library released under Common Public License. |