Contributors

 
Derrick Oswald
Software Development Manager
Autodesk Inc.
427 Laurier Avenue West
Suite 500
Ottawa, Ontario, Canada
K1R 7Y2
(613) 755-5065
http://www.autodesk.com
email
Voice : 613.755.5065
On Derrick Oswald

Derrick is the lead programmer and project manager of HTMLParser.

He started using the HTMLParser for an email message application (http://sourceforge.net/projects/connector) when he discovered the javax.swing.text.html.HTMLEditorKit just wouldn't cut it. Since then he has added character set processing functionality and URL connection handling as well as beans for text/link extraction and done some bug fixes.

Derrick has been programming for the last 17 years, and in java since version 1.0 first came out. He is a Sun Certified Developer and proselytizes Java development wherever possible.

Derrick on Canada

To Americans, Canada is the great white north. It's a vast expanse of land with only as many people as California. There is however, a surprising amount of activity in the high tech sector, especially in the capital region where about a million people live.

I love the spring, when I can ride my bike to work, the air is warm and the tap water is still ice cold. It's a great place to bring up kids (I have two) and offers a relaxed multi-lingual culture of fine art, athletics (go Senators!), industry, recreation, education and friendship.

Come and visit one February for Winterlude, but bring your woolly long johns.


Somik Raha
Extreme Programmer & Coach
Industrial Logic Inc.
2583 Cedar Street, Berkeley,
CA USA 94709
http://www.industriallogic.com
http://www.industrialxp.org
Personal Home Page
email
Voice : 510.540.8336
On Somik Raha

Somik was the founder and lead programmer/project manager of HTMLParser since its inception in April 2001 till April 2003. He started the project while he was working at Kizna Corporation, Japan. He is thankful to Kizna for allowing him to open source the parser, as it was not what Kizna was making.

Thereafter, hundreds of people wrote back, asking for modifications, giving design suggestions, bug reports, etc. and making the parser what it is today. The parser is truly a collaborative effort with people from different countries, companies, cultures.

Somik on India

India is a massive multi-cultural nation, home to over a billion people, a hustling and bustling democracy.

India has 27 states, and over 40 languages and dialects. Don't be surprised if you find two Indians communicating with each other in English - for that might be the only common language they share.

Food in India is awesome, with each state having its own cuisine. India's population includes Hindus, Muslims, Christians, Jains, Buddhists, Jews, Parsees and many others. Two famous Indian exports are : Yoga and Software :)


Joshua Kerievsky
Founder, Extreme Programmer & Coach
Industrial Logic Inc.
2583 Cedar Street, Berkeley,
CA USA 94709
http://www.industriallogic.com
http://www.industrialxp.org
email
Voice : 510.540.8336
On Joshua Kerievsky

Joshua drove the refactoring and design-review effort in the parser, which vastly improved the design. He has harvested refactorings from the htmlparser project for his forthcoming book. The visitor approach was introduced by him.

Joshua Kerievsky has been programming professionally since 1987, and is the founder of Industrial Logic, a company specializing in Extreme Programming (XP). Since 1999, Joshua has been coaching and programming on small, large and distributed XP projects and teaching XP to people throughout the world. He is the author of numerous XP and patterns-based articles, simulations and games, including the forthcoming book, Refactoring to Patterns.



Kaarle Kaila
Software Developer - Consult.
Kalenteritie 23 B 4
02200 Espoo, Finland
tel: +358-50-3725844
http://www.kk-software.fi
email
On Kaarle Kaila

Kaarle was one of the first users of the parser. He contributed a critical method - parseParameters(). This method is present today in the HTMLParameterParser class and it picks up all the attributes in a given tag.

This method is surprisingly hard to write - as it handles various kinds of dirty html available, and it has proved to be the single most used method in the parser.

Kaarle has used the parser in the J2EE extranet project at Nokia.

Kaarle on Finland

Finland might today be best known for Nokia but we are not only that. Did you know that according to Forbes magazine the richest (imaginary?) person lives in Finland. Who might that be? Hint: he travels with reindeer and he is most active in christmas time. Perhaps you knew that the reindeer is a common domestic animal in northern Finland (Lapland).

There's about 5 million of us here. We have more than 60 000 lakes and much more small islands both in the lakes and on the seaside. Some of us (like me) have their own island. You can find a picture of my island on my webpage if you look for it. Of course I have a sauna on the island.


Claude Duguay
Arcessa, Inc.
10210 NE Points Drive
Suite 310
Kirkland, WA 98033
http://www.arcessa.com/
email
On Claude Duguay

Claude made very important contributions to the parser, by doing massive scalability tests- bombarding the parser with over 5 million lines of html. The reports from his tests contributed greatly in improving the scalability of the parser.

Claude also contributed greatly to the design of the parser - giving it a professional touch. Some of his important contributions include chained exceptions and the feedback mechanism.

Claude Duguay has developed software for more than 20 years. He is passionate about Java, reads about it incessantly, and writes about it as often as possible.

 


Dhaval Udani
Senior Analyst
M-Line, QPEG
OrbiTech Solutions Ltd.
+91-22-28290019
Extn. 1457
http://www.orbitech.co.in
email

On Dhaval Udani

Dhaval has made many valuable contributions to the parser. He helped stabilize the script scanner, provided numerous bug reports, and contributed the input, select, option and textarea scanners.

He is also responsible for providing standardized end-of-line support - thus making the parser usable across Windows and Linux - which have different conventions for end-of-line characters.

Read Dhaval's article on The Quest for HTMLParser.

I've been passionate about computers from very early on. Started working
with computers very early on, but have been doing serious programming
only in the last 3 years after finishing my bachelors in computer engineering.

I am a pretty recent entrant to Java programming with just
1.5 years of experience on it but have totally fallen in love with it
and have remained involved ever since. The kick you get when a bug comes
up and you have to rack your brains to solve it is a high as none other.



Alberto Nacher
Software Developer - Consultant
Corso Sebastopoli 39,
10134 Torino, Italy
Personal Home Page
email
On Alberto Nacher

I was born in 1972, I'm a computer engineer and I have been working as consultant since 1998.

I've worked with Microsoft VB and VB.NET technologies, with Java technology and Livelink technology (knowledge management and developer enviroment of OpenText company).

My hobbies: travelling, seeing football matches, going out with friends, getting mushrooms, reading and this year also an English course!

Alberto on Italy

Italy is not so important if seen by high technology point of view. The main activities in my country are fashion, car development (FIAT, Ferrari, Alfa Romeo), pasta and food, wines and, of course, the big state companies doing telecommunication systems, electrical distribution, oil distribution. So... If you want to work as programmer you have no relevant software houses to join with and it is better being a technical consultant.

Anyway... If you want to visit Italy, you surely be charmed by the beauty of my country! Venice, Florence, Rome are some of the best towns in the world. But you can also visit Torino (my home town) where you can see the 2nd Egyptian museum in the world.


Dr. Sam Joseph
http://www.neurogrid.com
sam@neurogrid.com

On Sam Joseph

Sam Joseph is the creator of the Neurogrid Project, and uses HTMLParser in the Neurogrid.

Sam gave valuable comments on the design of the HTMLParser - which have been greatly appreciated by many users. The idea of having a toHTML() method was his. Along with the ideas of toPlainString(), ..., supporting meta tags,..

Sam is into the mobile market and consults to several companies in Japan in this field. He also does research projects at Tokyo University.

 

Cédric Rosa
email
Cédric was one of the most prolific testers of the parser, coming up with several bug reports daily. That went a long way toward making the parser really stable.  

Raghavender Srimantula
king_chiru@hotmail.com
Raghav provided the Form and Frame scanners, apart from providing valuable bug reports.  

Roger Sollberger
rrabbit@gmx.ch

Roger provided valuable bug reports, and helped to improve the StringNode parser - making it handle tag characters correctly.

Roger is from Switzerland, and used the parser in his diploma thesis.

 

Allen L. Fogleson
Senior Project Manager Crunchy Technologies
2111 Wilson Boulevard Suite 350
Arlington, Va 22201
http://www.crunchy.com/
EMail : afogleson@crunchytech.com
Voice : (703) 469-2032
Pager : (800) 826-3181

Allen was the first person to write back about the parser - his comments were a source of inspiration for us to go forward and make this a real product.

He contributed the scanners for handling JSP and ASP tags.

 
Annette Doyle
Annette.Doyle@ausinfo.com

Annette has provided bug reports and fixes - particularly in scanning image tags.

 

Nash Tsai
nash@pocketnet.com.tw
Nash helped with optimization and debugging issues.  

Taras Bendik
falcorus@mt.net.mk

Taras has helped with internationalization issues - handling of unicode.

Taras has also helped in testing, providing good bug reports and fixing them.

Taras is from the land of Alexander, the Great - Macedonia.

 


Arnaud Brejeon
Research Engineer
Gentech Corp
http://www.gen.co.jp
arnaud@gen.co.jp

Arnaud Brejeon is a research engineer in the field of Computer Vision, who shares a passion in Java, when he has some free time away from C++.

Arnaud spent valuable sessions pair-programming with Somik on the HTMLTag class- refactoring it - and providing a model of a class which was used for all the other base parsing classes.

 


Joe Ryburn
Technical Director
Lead Router LLC
Office 501-221-8865
Mobile 501-249-5015
http://www.leadrouter.com
joe@leadrouter.com

Joe gave an important suggestion - something that we had overlooked - handling the BASE tag while parsing links. He also submitted valuable bug reports that helped us improve the form and link scanners.  

Rodney S. Foley
email

Rodney made an important contribution to this project. HTMLParser initially used to have a complex mechanism of auto-registering scanners. Rodney first suggested that this should be done away with, as it was confusing.

This single suggestion helped simplify the design of the parser.

Rodney has also kindly offered to register and hold the htmlparser.org domain name and forward traffic to the SourceForge project page.

 

Thanks to Manuel Polo, Enrico Triolo, Gernot Fricke, Nick Burch, Stephen Harrington, Domenico Lordi, Kamen, John Zook, Cheng Jun, Mazlan Mat, Rob Shields, Wolfgang Germund, Raj Sharma, Robert Kausch, Gordon Deudney, Serge Kruppa, Roger Kjensrud, and Manpreet Singh for suggestions, bug reports and feature ideas.

Thanks to Jon Gillette for the cool new logo.