last data update: 2011/10/17, 20:34
Website loading time
during the test: 0.13 s
cable connection (average): 0.13 s
DSL connection (average): 0.14 s
modem (average): 0.31 s
HTTP headers
HTTP/1.0 301 Found
Server: Apache
Status: 301 Found
Expires: Wed, 19 Oct 2011 03:34:03 GMT
Date: Tue, 18 Oct 2011 03:34:03 GMT
Location: http://jericho.htmlparser.net/
HTTP/1.1 200 OK
Server: Apache/2.2.3 (CentOS)
Last-Modified: Sat, 05 Mar 2011 05:17:54 GMT
ETag: "4fa-49db5627f6480"
Accept-Ranges: bytes
Cache-Control: max-age=172800
Expires: Thu, 20 Oct 2011 03:34:03 GMT
Content-Type: text/html
Content-Length: 1274
Date: Tue, 18 Oct 2011 03:34:03 GMT
X-Varnish: 1813411376
Age: 0
Via: 1.1 varnish
Connection: close
Information about DNS servers
htmlparser.net | MX | 0 | mail7.zoneedit.com | IN | 7200 |
htmlparser.net | MX | 0 | mail6.zoneedit.com | IN | 7200 |
htmlparser.net | A | 216.98.141.250 | IN | 60 | |
htmlparser.net | A | 69.72.142.98 | IN | 60 | |
htmlparser.net | SOA | ns9.zoneedit.com | soacontact.zoneedit.com | 2011159410 | 2400 360 1209600 300 IN 7200 |
htmlparser.net | NS | ns7.zoneedit.com | IN | 7200 | |
htmlparser.net | NS | ns9.zoneedit.com | IN | 7200 |
Received from the first DNS server
Request to the server "htmlparser.net"
You used the following DNS server:
DNS Name: ns7.zoneedit.com
DNS Server Address: 216.122.7.155#53
DNS server aliases:
HEADER opcode: REQUEST, status: NOERROR, id: 3900
flag: qr aa rd REQUEST: 1, ANSWER: 7, AUTHORITY: 2, ADDITIONAL: 0
REQUEST SECTION:
htmlparser.net. IN ANY
ANSWER SECTION:
htmlparser.net. 7200 IN SOA ns9.zoneedit.com. soacontact.zoneedit.com. 2011159410 2400 360 1209600 300
htmlparser.net. 60 IN A 216.98.141.250
htmlparser.net. 60 IN A 69.72.142.98
htmlparser.net. 7200 IN NS ns7.zoneedit.com.
htmlparser.net. 7200 IN NS ns9.zoneedit.com.
htmlparser.net. 7200 IN MX 0 mail6.zoneedit.com.
htmlparser.net. 7200 IN MX 0 mail7.zoneedit.com.
AUTHORITY SECTION:
htmlparser.net. 7200 IN NS ns7.zoneedit.com.
htmlparser.net. 7200 IN NS ns9.zoneedit.com.
Received 231 bytes from address 216.122.7.155#53 in 36 ms
Received from the second DNS server
Request to the server "htmlparser.net"
You used the following DNS server:
DNS Name: ns9.zoneedit.com
DNS Server Address: 66.240.231.42#53
DNS server aliases:
HEADER opcode: REQUEST, status: NOERROR, id: 23950
flag: qr aa rd REQUEST: 1, ANSWER: 7, AUTHORITY: 2, ADDITIONAL: 0
REQUEST SECTION:
htmlparser.net. IN ANY
ANSWER SECTION:
htmlparser.net. 7200 IN SOA ns9.zoneedit.com. soacontact.zoneedit.com. 2011159410 2400 360 1209600 300
htmlparser.net. 60 IN A 216.98.141.250
htmlparser.net. 60 IN A 69.72.142.98
htmlparser.net. 7200 IN NS ns7.zoneedit.com.
htmlparser.net. 7200 IN NS ns9.zoneedit.com.
htmlparser.net. 7200 IN MX 0 mail6.zoneedit.com.
htmlparser.net. 7200 IN MX 0 mail7.zoneedit.com.
AUTHORITY SECTION:
htmlparser.net. 7200 IN NS ns7.zoneedit.com.
htmlparser.net. 7200 IN NS ns9.zoneedit.com.
Received 231 bytes from address 66.240.231.42#53 in 72 ms
Subdomains (the first 50)
Typos (misspells)
gtmlparser.net btmlparser.net ntmlparser.net jtmlparser.net utmlparser.net ytmlparser.net hrmlparser.net hfmlparser.net hgmlparser.net hymlparser.net h6mlparser.net h5mlparser.net htnlparser.net htklparser.net htjlparser.net htmkparser.net htmpparser.net htmoparser.net htmloarser.net htmllarser.net html-arser.net html0arser.net htmlpzrser.net htmlpsrser.net htmlpwrser.net htmlpqrser.net htmlpaeser.net | htmlpadser.net htmlpafser.net htmlpatser.net htmlpa5ser.net htmlpa4ser.net htmlparaer.net htmlparzer.net htmlparxer.net htmlparder.net htmlpareer.net htmlparwer.net htmlparswr.net htmlparssr.net htmlparsdr.net htmlparsrr.net htmlpars4r.net htmlpars3r.net htmlparsee.net htmlparsed.net htmlparsef.net htmlparset.net htmlparse5.net htmlparse4.net tmlparser.net hmlparser.net htlparser.net htmparser.net | htmlarser.net htmlprser.net htmlpaser.net htmlparer.net htmlparsr.net htmlparse.net thmlparser.net hmtlparser.net htlmparser.net htmplarser.net htmlaprser.net htmlpraser.net htmlpasrer.net htmlparesr.net htmlparsre.net hhtmlparser.net httmlparser.net htmmlparser.net htmllparser.net htmlpparser.net htmlpaarser.net htmlparrser.net htmlparsser.net htmlparseer.net htmlparserr.net |
Location
IP: 216.98.141.250, 69.72.142.98
continent: NA, country: United States (USA), city: Clifton
Website value
rank in the traffic statistics:
There is not enough data to estimate website value.
Basic information
website build using CSS
code weight: 1.24 KB
text per all code ratio: 55 %
title: Jericho HTML Parser
description: Jericho HTML Parser is a java library allowing analysis and manipulation of parts of an HTML document, including server-side tags, while reproducing verbatim any unrecognised or invalid HTML.
keywords: html, parser, java, library, html form, .NET, DotNet
encoding: ASCII
language: en
Website code analysis
one word phrases repeated minimum three times
Phrase | Quantity |
---|---|
the | 85 |
of | 59 |
to | 36 |
and | 34 |
HTML | 30 |
in | 28 |
document | 23 |
is | 20 |
for | 17 |
source | 17 |
Demonstrates | 15 |
The | 15 |
are | 13 |
as | 13 |
be | 12 |
an | 12 |
that | 11 |
with | 11 |
use | 11 |
text | 10 |
or | 9 |
from | 9 |
can | 9 |
by | 9 |
form | 9 |
all | 9 |
but | 8 |
which | 8 |
not | 7 |
class | 7 |
on | 7 |
into | 7 |
functionality | 6 |
Parser | 6 |
other | 6 |
parser | 6 |
it | 6 |
based | 6 |
this | 6 |
without | 5 |
only | 5 |
tags | 5 |
simple | 5 |
how | 5 |
element | 5 |
new | 5 |
easily | 5 |
if | 5 |
library | 5 |
tags, | 5 |
file | 5 |
document. | 4 |
XML | 4 |
very | 4 |
project | 4 |
at | 4 |
each | 4 |
DTD | 4 |
written | 4 |
formatted | 4 |
such | 4 |
files | 4 |
tree | 4 |
segments | 4 |
parsed | 4 |
This | 4 |
Built-in | 4 |
badly | 4 |
server | 4 |
positions | 4 |
called | 4 |
sample | 4 |
also | 4 |
structure. | 3 |
event | 3 |
its | 3 |
allows | 3 |
It | 3 |
HTML. | 3 |
setting | 3 |
following | 3 |
parsers | 3 |
GNU | 3 |
parser. | 3 |
containing | 3 |
common | 3 |
tag | 3 |
Jericho | 3 |
have | 3 |
manipulation | 3 |
number | 3 |
analysis | 3 |
data | 3 |
nodes | 3 |
none | 3 |
document, | 3 |
package | 3 |
search | 3 |
Supports | 3 |
was | 3 |
format | 3 |
their | 3 |
now | 3 |
so | 3 |
fields | 3 |
well | 3 |
one | 3 |
these | 3 |
two word phrases repeated minimum three times
Phrase | Quantity |
---|---|
of the | 20 |
in the | 14 |
Demonstrates the | 9 |
use of | 8 |
the use | 8 |
the source | 7 |
can be | 6 |
HTML Parser | 6 |
source document | 5 |
how to | 5 |
the document | 5 |
HTML source | 4 |
class that | 4 |
Demonstrates how | 4 |
of HTML | 4 |
Built-in functionality | 4 |
badly formatted | 4 |
to be | 4 |
file called | 4 |
functionality to | 4 |
such as | 4 |
which is | 4 |
Jericho HTML | 3 |
well as | 3 |
written to | 3 |
to file | 3 |
tree based | 3 |
Supports document | 3 |
is written | 3 |
simple text | 3 |
for the | 3 |
as well | 3 |
document is | 3 |
with the | 3 |
an event | 3 |
of nodes | 3 |
positions of | 3 |
document structure. | 3 |
nodes in | 3 |
the positions | 3 |
three word phrases repeated minimum three times
Phrase | Quantity |
---|---|
Demonstrates the use | 8 |
use of the | 8 |
the use of | 8 |
in the source | 6 |
Built-in functionality to | 4 |
Demonstrates how to | 4 |
of nodes in | 3 |
Jericho HTML Parser | 3 |
of the document | 3 |
nodes in the | 3 |
positions of nodes | 3 |
the positions of | 3 |
to file called | 3 |
written to file | 3 |
as well as | 3 |
is written to | 3 |
B tags
javadocs
http://sourceforge.net/projects/jerichohtml/
U tags
I tags
x.x
ProgramName
images
file name | alternative text |
---|---|
sflogolocal.png | Jericho HTML Parser at SourceForge.net |
sflogo.php?group_id=101067&type=11 |
headers
H1
Jericho HTML Parser
H2
Features
Sample Programs
Building
Alternative HTML Parsers
H3
Features
Sample Programs
Building
Alternative HTML Parsers
H4
H5
H6
internal links
address | anchor text |
---|---|
privacy-policy.html | privacy policy |
http://jericho.htmlparser.net/ | Jericho HTML Parser |
javadoc/index.html | javadocs |
../release.txt | release.txt |
javadoc/net/htmlparser/jericho/StreamedSource.html | StreamedSource |
javadoc/net/htmlparser/jericho/Source.html#DocumentElementHierarchy | document element hierarchy |
javadoc/net/htmlparser/jericho/Segment.html#getBegin() | begin |
javadoc/net/htmlparser/jericho/Segment.html#getEnd() | end |
javadoc/net/htmlparser/jericho/Source.html#getRowColumnVector(int) | row and column number |
javadoc/net/htmlparser/jericho/Segment.html#findFormFields() | analysis and manipulation of HTML form controls |
javadoc/net/htmlparser/jericho/FormField.html#getValues() | extraction |
javadoc/net/htmlparser/jericho/FormField.html#setValue(java.lang.CharSequence) | population |
javadoc/net/htmlparser/jericho/FormControl.html#setDisabled(boolean) | read-only |
javadoc/net/htmlparser/jericho/FormControl.html#setOutputStyle(net.htmlparser.jericho.FormControlOutputStyle) | data display |
javadoc/net/htmlparser/jericho/TagType.html#register() | registered |
javadoc/net/htmlparser/jericho/TextExtractor.html | extract all text from HTML markup |
javadoc/net/htmlparser/jericho/Renderer.html | render HTML markup |
javadoc/net/htmlparser/jericho/SourceFormatter.html | format HTML source code |
javadoc/net/htmlparser/jericho/Source.html#DocumentElementHierarchy | document element hierarchy |
javadoc/net/htmlparser/jericho/SourceCompactor.html | compact HTML source code |
../samples/console/src/DisplayAllElements.java | DisplayAllElements.java |
../samples/console/src/FindSpecificTags.java | FindSpecificTags.java |
javadoc/net/htmlparser/jericho/StartTagType.html#DOCTYPE_DECLARATION | document type declarations |
javadoc/net/htmlparser/jericho/StartTagType.html#XML_DECLARATION | XML declarations |
javadoc/net/htmlparser/jericho/StartTagType.html#XML_PROCESSING_INSTRUCTION | XML processing instructions |
javadoc/net/htmlparser/jericho/StartTagType.html#SERVER_COMMON | common server tags |
javadoc/net/htmlparser/jericho/PHPTagTypes.html | PHP tags |
javadoc/net/htmlparser/jericho/MasonTagTypes.html | Mason tags |
javadoc/net/htmlparser/jericho/StartTagType.html#COMMENT | HTML comments |
../samples/console/src/ExtractText.java | ExtractText.java |
javadoc/net/htmlparser/jericho/TextExtractor.html | TextExtractor |
../samples/console/src/RenderToText.java | RenderToText.java |
javadoc/net/htmlparser/jericho/Renderer.html | Renderer |
../samples/console/src/HTMLSanitiser.java | HTMLSanitiser.java |
../test/src/samples/HTMLSanitiserTest.java | here |
../samples/console/src/StreamedSourceCopy.java | StreamedSourceCopy.java |
javadoc/net/htmlparser/jericho/StreamedSource.html | StreamedSource |
../samples/console/src/FormControlDisplayCharacteristics.java | FormControlDisplayCharacteristics.java |
javadoc/net/htmlparser/jericho/FormControl.html#DisplayCharacteristics | display characteristics |
javadoc/net/htmlparser/jericho/FormControl.html#setDisabled(boolean) | disabled |
javadoc/net/htmlparser/jericho/FormControlOutputStyle.html#REMOVE | removed |
javadoc/net/htmlparser/jericho/FormControlOutputStyle.html#DISPLAY_VALUE | display value |
../samples/console/src/FormFieldCSVOutput.java | FormFieldCSVOutput.java |
javadoc/net/htmlparser/jericho/FormFields.html#getColumnValues(java.util.Map) | FormFields.getColumnValues(Map) |
../samples/console/src/FormFieldList.java | FormFieldList.java |
javadoc/net/htmlparser/jericho/Segment.html#findFormFields() | Segment.findFormFields() |
../samples/console/src/FormFieldSetValues.java | FormFieldSetValues.java |
javadoc/net/htmlparser/jericho/FormFields.html | FormFields |
../samples/console/src/FormatSource.java | FormatSource.java |
javadoc/net/htmlparser/jericho/SourceFormatter.html | SourceFormatter |
../samples/console/src/CompactSource.java | CompactSource.java |
javadoc/net/htmlparser/jericho/SourceCompactor.html | SourceCompactor |
../samples/console/src/Encoding.java | Encoding.java |
../samples/console/src/SplitLongLines.java | SplitLongLines.java |
../samples/console/src/ConvertStyleSheets.java | ConvertStyleSheets.java |
privacy-policy.html | privacy policy |
external links
address | anchor text |
---|---|
http://sourceforge.net/projects/jerichohtml | Jericho HTML Parser at SourceForge.net |
http://www.webventure.com.au/ | Tweed Coast IT Services |
http://www.eclipse.org/legal/epl-v10.html | Eclipse Public License (EPL) |
http://www.gnu.org/copyleft/lesser.html | GNU Lesser General Public License (LGPL) |
http://sourceforge.net/projects/jerichohtml/ | http://sourceforge.net/projects/jerichohtml/ |
http://sourceforge.net/project/showfiles.php?group_id=101067 | downloads |
http://sourceforge.net/forum/forum.php?forum_id=350025 | support |
http://freshmeat.net/projects/jerichohtml/ | http://freshmeat.net/projects/jerichohtml/ |
http://msdn.microsoft.com/asp/ | ASP |
http://java.sun.com/products/jsp/ | JSP |
http://www.modpython.org/ | PSP |
http://www.php.net | PHP |
http://www.masonhq.com/ | Mason |
http://en.wikipedia.org/wiki/StAX | StAX |
http://www.saxproject.org/event.html | event nor tree based parser |
http://www.w3.org/DOM/ | DOM |
http://www.saxproject.org/ | SAX |
http://lucene.apache.org/java/ | Apache Lucene |
http://htmlparser.appspot.com/samples/FormatSource.jsp | Click here for an online demonstration |
http://htmlparser.appspot.com/samples/FormatSource.jsp | Click here for an online demonstration |
http://www.quiotix.com/downloads/html-parser/ | http://www.quiotix.com/downloads/html-parser/ |
http://jtidy.sourceforge.net/ | http://jtidy.sourceforge.net/ |
http://htmlparser.sourceforge.net/ | http://htmlparser.sourceforge.net/ |
http://www.apache.org/~andyc/neko/doc/html/index.html | http://www.apache.org/~andyc/neko/doc/html/index.html |
http://www.webventure.com.au/ | WebVenture.com.au |
http://www.corporatetranslations.com.au/ | Corporate Translations |
http://www.takingcareoftrees.com.au/ | Taking Care of Trees |
http://sourceforge.net/projects/jerichohtml | Jericho HTML Parser at SourceForge.net |