Serialized Form
mName
String mName
- The name of this attribute.
The part before the equals sign, or the stand-alone attribute.
This will be
null if the attribute is whitespace.
mAssignment
String mAssignment
- The assignment string of the attribute.
The equals sign.
This will be
null if the attribute is a
stand-alone attribute.
mValue
String mValue
- The value of the attribute.
The part after the equals sign.
This will be
null if the attribute is an empty or
stand-alone attribute.
mQuote
char mQuote
- The quote, if any, surrounding the value of the attribute, if any.
This will be zero if there are no quotes around the value.
mFeedback
ParserFeedback mFeedback
- Feedback object.
mLexer
Lexer mLexer
- The html lexer associated with this parser.
mText
Text mText
- The prototypical text node.
mRemark
Remark mRemark
- The prototypical remark node.
mTag
Tag mTag
- The prototypical tag node.
mBlastocyst
Map<K,V> mBlastocyst
- The list of tags to return.
The list is keyed by tag name.
|
Package org.htmlparser.beans |
mTrail
Vector<E> mTrail
- Bread crumb trail of visited URLs.
mCrumb
int mCrumb
- Current position on the bread crumb trail.
mLinkBean
HTMLLinkBean mLinkBean
mForward
JMenuItem mForward
mBack
JMenuItem mBack
mCollapse
JCheckBoxMenuItem mCollapse
mTextField
JTextField mTextField
mSplitPane
JSplitPane mSplitPane
mLinks
JCheckBoxMenuItem mLinks
mStringBean
HTMLTextBean mStringBean
mNobreak
JCheckBoxMenuItem mNobreak
mPropertySupport
PropertyChangeSupport mPropertySupport
- Bound property support.
mParser
Parser mParser
- The parser used to filter.
mFilters
NodeFilter[] mFilters
- The filter set.
mNodes
NodeList mNodes
- The nodes extracted from the URL.
mRecursive
boolean mRecursive
- The recursion behaviour for elements of the filter array.
If
true the filters are applied recursively.
- See Also:
NodeList.extractAllNodesThatMatch(NodeFilter, boolean).
mBean
LinkBean mBean
- The underlying bean that provides our htmlparser specific properties.
mBean
StringBean mBean
- The underlying bean that provides our htmlparser specific properties.
mPropertySupport
PropertyChangeSupport mPropertySupport
- Bound property support.
mLinks
URL[] mLinks
- The strings extracted from the URL.
mParser
Parser mParser
- The parser used to extract strings.
mPropertySupport
PropertyChangeSupport mPropertySupport
- Bound property support.
mParser
Parser mParser
- The parser used to extract strings.
mStrings
String mStrings
- The strings extracted from the URL.
mLinks
boolean mLinks
- If
true the link URLs are embedded in the text output.
mReplaceSpace
boolean mReplaceSpace
- If
true regular space characters are substituted for
non-breaking spaces in the text output.
mCollapse
boolean mCollapse
- If
true sequences of whitespace characters are replaced
with a single space character.
mCollapseState
int mCollapseState
- The state of the collapse processiung state machine.
mBuffer
StringBuffer mBuffer
- The buffer text is stored in while traversing the HTML.
mIsScript
boolean mIsScript
- Set
true when traversing a SCRIPT tag.
mIsPre
boolean mIsPre
- Set
true when traversing a PRE tag.
mIsStyle
boolean mIsStyle
- Set
true when traversing a STYLE tag.
|
Package org.htmlparser.filters |
mPredicates
NodeFilter[] mPredicates
- The predicates that are to be and'ed together;
therule
NodeFilter therule
m
Matcher m
tokentype
int tokentype
token
String token
mAttribute
String mAttribute
- The attribute to check for.
mValue
String mValue
- The value to check for.
mChildFilter
NodeFilter mChildFilter
- The filter to apply to children.
mRecursive
boolean mRecursive
- Performs a recursive search down the node heirarchy if
true.
mParentFilter
NodeFilter mParentFilter
- The filter to apply to the parent.
mRecursive
boolean mRecursive
- Performs a recursive search up the node heirarchy if
true.
mSiblingFilter
NodeFilter mSiblingFilter
- The filter to apply to the sibling.
mNode
Node mNode
- The node to match.
mRegex
Pattern mRegex
- The regular expression to use on the link.
mPattern
String mPattern
- The pattern to search for in the link.
mCaseSensitive
boolean mCaseSensitive
- Flag indicating case sensitive/insensitive search.
mClass
Class<T> mClass
- The class to match.
mPredicate
NodeFilter mPredicate
- The filter to gainsay.
mPredicates
NodeFilter[] mPredicates
- The predicates that are to be or'ed together;
mPatternString
String mPatternString
- The regular expression to search for.
mPattern
Pattern mPattern
- The compiled regular expression to search for.
mStrategy
int mStrategy
- The match strategy.
- See Also:
RegexFilter.RegexFilter(String, int)
mPattern
String mPattern
- The string to search for.
mUpperPattern
String mUpperPattern
- The string to really search for (converted to uppercase if necessary).
mCaseSensitive
boolean mCaseSensitive
- Case sensitive toggle.
If
true strings are compared with case sensitivity.
mLocale
Locale mLocale
- The locale to use converting to uppercase in case insensitive searches.
mName
String mName
- The tag name to match.
mPredicates
NodeFilter[] mPredicates
- The predicates that are to be xor'ed together;
|
Package org.htmlparser.http |
mName
String mName
- The name of the cookie.
mValue
String mValue
- The cookie value.
mComment
String mComment
- Describes the cookie's use.
mDomain
String mDomain
- Domain that sees cookie.
mExpiry
Date mExpiry
- Cookie expires after this date.
mPath
String mPath
- URLs that see the cookie.
mSecure
boolean mSecure
- Use SSL.
mVersion
int mVersion
- If Version=1 it means RFC 2109++ style cookies.
|
Package org.htmlparser.lexer |
mPosition
int mPosition
- This cursor's position.
mPage
Page mPage
- This cursor's page.
readObject
private void readObject(ObjectInputStream in)
throws IOException,
ClassNotFoundException
- Deserialization support.
- Throws:
IOException - If deserialization has a problem.
ClassNotFoundException
writeObject
private void writeObject(ObjectOutputStream out)
throws IOException
- Serialization support.
- Throws:
IOException - If serialization has a problem.
mEncoding
String mEncoding
- The character set in use.
mBuffer
char[] mBuffer
- The characters read so far.
mLevel
int mLevel
- The number of valid bytes in the buffer.
mOffset
int mOffset
- The offset of the next byte returned by read().
mMark
int mMark
- The bookmark.
mPage
Page mPage
- The page lexemes are retrieved from.
mCursor
Cursor mCursor
- The current position on the page.
mFactory
NodeFactory mFactory
- The factory for new nodes.
readObject
private void readObject(ObjectInputStream in)
throws IOException,
ClassNotFoundException
- Deserialize the page.
For details see
writeObject().
- Throws:
IOException - If there is a deserialization problem with
the stream.
ClassNotFoundException - If the deserialized class can't be
located with the current classpath and class loader.
writeObject
private void writeObject(ObjectOutputStream out)
throws IOException
- Serialize the page.
There are two modes to serializing a page based on the connected state.
If connected, the URL and the current offset is saved, while if
disconnected, the underling source is saved.
- Throws:
IOException - If there is a serialization problem.
mUrl
String mUrl
- The URL this page is coming from.
Cached value of
getConnection().toExternalForm() or
setUrl().
mBaseUrl
String mBaseUrl
- The base URL for this page.
mSource
Source mSource
- The source of characters.
mIndex
PageIndex mIndex
- Character positions of the first character in each line.
mPage
Page mPage
- The page this attribute is extracted from.
mNameStart
int mNameStart
- The starting offset of the name within the page.
If negative, the name is considered
null.
mNameEnd
int mNameEnd
- The ending offset of the name within the page.
mValueStart
int mValueStart
- The starting offset of the value within the page.
If negative, the value is considered
null.
mValueEnd
int mValueEnd
- The ending offset of the name within the page.
mIncrement
int mIncrement
- Increment for allocations.
mCount
int mCount
- The number of valid elements.
mIndices
int[] mIndices
- The elements.
mPage
Page mPage
- The page associated with this index.
mString
String mString
- The source of characters.
mOffset
int mOffset
- The current offset into the string.
mEncoding
String mEncoding
- The encoding to report.
Only used by
StringSource.getEncoding().
mMark
int mMark
- The bookmark.
|
Package org.htmlparser.lexerapplications.thumbelina |
mURL
URL mURL
- The URL for the picture.
mImage
Image mImage
- The image for the picture.
mOrigin
Point mOrigin
- The upper left hand corner of the image.
This doesn't change, even if the image is cropped.
For example, if the left half of the image is obscured by another,
the
Rectangle fields x, y,
width and height will change, but the
origin remains the same.
mThumbelina
Thumbelina mThumbelina
- The thumbelina object in use.
mMosaic
TileSet mMosaic
- The display mosaic.
mPreferredSize
Dimension mPreferredSize
- The preferred size of this component.
null initially, caches the results of
calculatePreferredSize ().
mUrls
ArrayList<E> mUrls
- URL's to visit.
mVisited
HashMap<K,V> mVisited
- URL's visited.
mRequested
HashMap<K,V> mRequested
- Images requested.
mTracked
HashMap<K,V> mTracked
- Images being tracked currently.
mThread
Thread mThread
- Background thread.
mActive
boolean mActive
- Activity state.
true means processing URLS, false not.
mSequencer
Sequencer mSequencer
- The picture sequencer.
mPicturePanel
PicturePanel mPicturePanel
- The central area for pictures.
mPropertySupport
PropertyChangeSupport mPropertySupport
- Bound property support.
mCurrentURL
String mCurrentURL
- The URL being currently being examined.
mDiscardCGI
boolean mDiscardCGI
- If
true, does not follow links containing cgi calls.
mDiscardQueries
boolean mDiscardQueries
- If
true, does not follow links containing queries (?).
mBackgroundToggle
JCheckBox mBackgroundToggle
- Background thread checkbox in status bar.
mHistory
JList mHistory
- History list.
mPicturePanelScroller
JScrollPane mPicturePanelScroller
- Scroller for the picture panel.
mHistoryScroller
JScrollPane mHistoryScroller
- Scroller for the history list.
mMainArea
JSplitPane mMainArea
- Main panel in central area.
mPowerBar
JPanel mPowerBar
- Status bar.
mQueueProgress
JProgressBar mQueueProgress
- Image request queue monitor in status bar.
mReadyProgress
JProgressBar mReadyProgress
- Image ready queue monitor in status bar.
mRunToggle
JCheckBox mRunToggle
- Sequencer thread toggle in status bar.
mSpeedSlider
JSlider mSpeedSlider
- Sequencer speed slider in status bar.
mUrlText
JTextField mUrlText
- URL report in status bar.
mQueueSize
JLabel mQueueSize
- URL queue size display in status bar.
mVisitedSize
JLabel mVisitedSize
- URL visited count display in status bar.
mMenu
JMenuBar mMenu
- Main menu.
mURL
JMenu mURL
- URL submenu.
mOpen
JMenuItem mOpen
- Open menu item.
mGoogle
JMenuItem mGoogle
- Google menu item.
mSeparator1
JSeparator mSeparator1
- MRU list separator #1.
mSeparator2
JSeparator mSeparator2
- MRU list separator #2.
mExit
JMenuItem mExit
- Exit menu item.
mView
JMenu mView
- View submenu.
mStatusVisible
JCheckBoxMenuItem mStatusVisible
- Status bar visible menu item.
mHistoryVisible
JCheckBoxMenuItem mHistoryVisible
- History list visible menu item.
mCommand
JMenu mCommand
- Vommand menu.
mReset
JMenuItem mReset
- Reset menu item.
mClear
JMenuItem mClear
- Clear menu item
mHelp
JMenu mHelp
- Help submenu.
mAbout
JMenuItem mAbout
- About menu item.
|
Package org.htmlparser.nodes |
mPage
Page mPage
- The page this node came from.
nodeBegin
int nodeBegin
- The beginning position of the tag in the line
nodeEnd
int nodeEnd
- The ending position of the tag in the line
parent
Node parent
- The parent of this node.
children
NodeList children
- The children of this node.
mText
String mText
- The contents of the remark node, or override text.
mScanner
Scanner mScanner
- The scanner for this tag.
mAttributes
Vector<E> mAttributes
- The tag attributes.
Objects of type
Attribute.
The first element is the tag name, subsequent elements being either
whitespace or real attributes.
mText
String mText
- The contents of the string node, or override text.
|
Package org.htmlparser.parserapplications.filterbuilder |
mBasePoint
Point mBasePoint
- The relative position of the mouse while dragging.
mSelection
Vector<E> mSelection
- Selected commands.
mMoved
boolean mMoved
- If true selection moved.
mDropTarget
DropTarget mDropTarget
- This component is a drop target.
mDragSource
DragSource mDragSource
- Enables this component to be a Drag Source.
mCurrentComponent
Component mCurrentComponent
- Kludge: Used by actionPerformed/filterAction to remember the filter menu item.
mMainPanel
JPanel mMainPanel
- The main panel GUI component.
mMainScroller
JScrollPane mMainScroller
- The main panel scrolling GUI component.
mURLField
JTextField mURLField
- The URL input GUI component.
mOutput
JDesktopPane mOutput
- The output panel GUI component.
mExtra
int mExtra
- Padding for the drop target.
mSpacer
Component mSpacer
- The drop target spacer at the bottom of the list.
mHome
Filter mHome
- The parent filter wrapper.
mTitle
String mTitle
- The textual title for the list.
mMax
int mMax
- The number of filters limit.
|
Package org.htmlparser.parserapplications.filterbuilder.layouts |
|
Package org.htmlparser.parserapplications.filterbuilder.wrappers |
mContainer
SubFilterList mContainer
- The drop target container.
mFilter
AndFilter mFilter
- The underlying filter.
mFilter
HasAttributeFilter mFilter
- The underlying filter.
mAttributeName
JComboBox mAttributeName
- Combo box for attribute name.
mValued
JCheckBox mValued
- The check box for has value.
mAttributeValue
JTextArea mAttributeValue
- Value to check for.
mFilter
HasChildFilter mFilter
- The underlying filter.
mRecursive
JCheckBox mRecursive
- The check box for recursion.
mContainer
SubFilterList mContainer
- The drop target container.
mFilter
HasParentFilter mFilter
- The underlying filter.
mRecursive
JCheckBox mRecursive
- The check box for recursion.
mContainer
SubFilterList mContainer
- The drop target container.
mFilter
HasSiblingFilter mFilter
- The underlying filter.
mContainer
SubFilterList mContainer
- The drop target container.
mFilter
NodeClassFilter mFilter
- The underlying filter.
mClass
JComboBox mClass
- Combo box for strategy.
mContainer
SubFilterList mContainer
- The drop target container.
mFilter
NotFilter mFilter
- The underlying filter.
mContainer
SubFilterList mContainer
- The drop target container.
mFilter
OrFilter mFilter
- The underlying filter.
mFilter
RegexFilter mFilter
- The underlying filter.
mPattern
JTextArea mPattern
- Text to check for
mStrategy
JComboBox mStrategy
- Combo box for strategy.
mFilter
StringFilter mFilter
- The underlying filter.
mPattern
JTextArea mPattern
- Text to check for.
mCaseSensitivity
JCheckBox mCaseSensitivity
- The check box for case sensitivity.
mLocale
JComboBox mLocale
- Combo box for locale.
mFilter
TagNameFilter mFilter
- The underlying filter.
mName
JComboBox mName
- Combo box for tag name.
|
Package org.htmlparser.scanners |
|
Package org.htmlparser.tags |
mEndTag
Tag mEndTag
- The tag that causes this tag to finish.
May be a virtual tag generated by the scanning logic.
mFormLocation
String mFormLocation
- This is the derived form location, based on action.
imageURL
String imageURL
- Holds the set value of the SRC attribute, since this can differ
from the attribute value due to relative references resolved by
the scanner.
mLink
String mLink
- The URL where the link points to
mailLink
boolean mailLink
- Set to true when the link was a mailto: URL.
javascriptLink
boolean javascriptLink
- Set to true when the link was a javascript: URL.
mCode
String mCode
- Script code if different from the page contents.