svn merge -c 1297274 from trunk to branch-0.23.2 FIXES HADOOP-8064. Remove unnecessary dependency on w3c.org in document processing (Khiwal Lee via bobby)
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-0.23.2@1297280 13f79535-47bb-0310-9956-ffa450edef68