• Fix some bugs about HTML to article.
  • Improving the efficiency of extract.