peknet :: an eddy in the bit stream         
about peknet


peknet is Peter E Karman. you'll find here musings on technology, politics, religion, books, beer and frequent references to my beautiful sons.

navigate

credits

Powered by Swish-e

Valid CSS!

proud member of the
Open Source Community

© 2005 peknet dot com

syndicate this site

HTML Highlighting

I can't count the hours I've spent hacking at a foolproof highlighter for HTML. But I'm nearing a really good approximation of foolproof. I've posted HTML::HiLiter to the Perl CPAN.

The really hard thing about this was creating a regular expression that is fast enough to be useful but accurate enough to work 99% of the time. I ended up using the HTML::Parser module, which is 'fast enough' and very powerful, due to the embedded C code and some good design. I've also looked at HTML::Tree but because HTML::Parser was a standard module in Perl 5.6.x it makes more sense to me right now to use a widespread standard. It increases the chance that folks might find HTML::HiLiter useful.

The most recent version (0.11) is due to get posted soon. I'm excited about it: I've improved the speed and accuracy, and added several features to help support my other recent project: SWISH::HiLiter -- an extension to the SWISH::API class.

Both these projects are open source and come out of my Cray work on CrayDoc. A huge project for me, and a real learning experience: character encodings, HTML syntax, and the power of Perl regular expressions. I'd wager that my Perl skills increased %500 as a result of this project.

If you use it, let me know what you think.

File under projects Thu Oct 28 22:21:51 CT 2004


2008 entries » 2007 entries » 2006 entries » 2005 entries » 2004 entries »
« July  August 2008  Sept »
Sun Mon Tue Wed Thu Fri Sat
         
           
weather
Saint Paul, MN
68 F (20 C)
Fair

Sunrise/Sunset
Updated:
Aug 28, 10:53 pm CDT

worth reading