On a cohesive focused and path-ascending crawling scheme for improved search results – F. U. Ogban, P. O. Asagba, Olumide Owolabi
The quality of the results collections, displayed to users of web search engines today still remains a mirage with regards the factors used in their ranking process. While the results are often amusing and expand users’ horizons, they are often frustrating and consume precious time. The most important measure of a search engine is the quality of its search results. In this work, we combined the Focused crawling method developed by (Chakrabati; 1999) and Path-ascending crawling method of (Cothey; 2004), to create a hybridized method. Our major objective is to unify into one search system, the path-ascending and focused crawling methods to get into crawling the deep web at an efficient rate for a better recall and precision. The path-ascending and focused crawling systems respectively were aimed at producing increase recall and the function of similarity of a query to a matched document. The unification now is to help produce additionally the most authoritative pages in a search result and therefore increasing the precision thereof, thereby given enhance/increase authority and the strength of search/match of given queries to their corresponding documents in the web. The methodologies adopted are the Document and Query Likelihood Models. Documents or corpora of known measures in query types, recalls and precision from the Text Retrieval Conference (TREC), the Initiative for the Evaluation of XML Retrieval (INEX) and REUTERs Collection, were used as work bench for evaluation of the system. We obtained favorable results, which showed significant improvement from results if implemented using the Focused crawling method and path-ascending method as a single entity.