Primary Content Block Detection from Web Page Clusters through Entropy and Semantic Distance | IEEE Conference Publication | IEEE Xplore