jtidy is not correctly parsing html doms with nested comments. See this url as an example: http://www.cnn.com/2011/WORLD/africa/06/13/somalia.mastermind/index.html?hpt=hp_t1
These lines are found in the head:
<![endif]-->
Everything following the nested open comment through the end of the document is included in a single node.