Abstract
1- INTRODUCTION
2- WEB STRUCTURAL MINING
3- WEB CONTENT MINING
4- MULTI-MEDIA MINING
5- CONCLUSION AND FUTURE WORK
REFERENCES
Abstract
With the rapid development of Internet, we have entered an era of information explosion, there is a lot of redundant information in the Network. How to extract a useful part of this information from the massive information resources, analyzing the vast amount of information and finally get the potential knowledge we want to extract. Web mining technology came into being, and saved out the human from the information ocean. This paper will analyze the realization of Web content mining and Web structure mining, their basic algorithm principles and their application areas.
INTRODUCTION
What is Web mining? It is the process that discover and extract the useful mode and knowledge that people are interested from the massive Web documents and activities through data mining technology [1]. Compared to the wellknown Data mining, Web mining can be extended to a deeper and broader areas, the differences between them are also very obvious: the object of data mining is the data stored in database, that is to say, the structured data; Web Mining aims at the contents or structure of Web document, which has a feature of wide-distributed, dynamic and heterogeneous, and contains unstructured or semi-structured data. Based on the diversity of information on the Web, Web mining is divided into the following category as shown in figure 1: Web structure mining, Web content mining and Web usage mining [2]. These three mining methods are different in the aspect of dealing with the main data, processing methods and application areas.