xtim
Thursday, January 08, 2009
 
Broken characters
Lots of imports today.

Also found a bug in RubberStamp which caused text extraction to stop on the first page that contained a non-UTF8-expressible character. I think the file in question was actually a little broken (odd charmap?) as I can't think there are many occasions when the text of a page wouldn't be expressible in UTF-8. It was an embedded ad so something had probably got broken higher up in the production process. Now fixed. We just screen out unencodable characters as they're processed.

Also, replaced the waste plumbing on the bath to fix a leak. Lessons learned:


  1. Don't expect to be able to re-use old compression joints (the ones without a screw collar)

  2. All 40mm pipes are not the same diameter

  3. Leave as much of the original plumbing in place as possible, with an old pipe as the connection point. You can get universal connectors to fit old pipes, but not universal pipes to fit old connectors



T
Comments: Post a Comment

<< Home

Powered by Blogger