Wednesday, September 28, 2011

If you are new to Web Mining…

If you selling products and services via web channel you may consider analyzing who is visiting your web site and how do people who buy differ from thos that don’t, and out of those who buy - what is their clickstream sequence and navigational pattern.
Each customer's action on a website generates data, and not just high-level interactions such as buying something but also something as simple as using a search engine or navigating through a site. All these interactions between digital service providers, and the consumer can be recorded, and stored in digital databases. These large data sets contain information helpful to business marketing strategies, both - for retrospective analysis, as well as for data-driven forecasting.

Companies today are in the unprecedented position of being able to collect vast amounts of customer information relatively easily. By using web mining, companies can analyze and predict the behavior of their customers. All web site visitors leave digital trails which web servers automatically store in log files. Web analysis tools analyze, and process these web server logs files to produce meaningful information. Essentially, a complete profile of site traffic is created which shows for example, how many visitors there were to the site, what sites they came from, and which pages on the site are most popular. Web analysis tools provide companies with previously unknown statistics, and useful insights into the behavior of their online customers. While the usage and popularity of such tools may continue to increase, many online retailers are now demanding more useful information about their customers, from the vast amounts of data generated by their web sites.

Organizations have typically invested large amounts of money into developing their web sites and web strategy and they would like to know what return they are receiving on their investment. Most sites use hits and page views as measure of success of the web site, which clearly is not going to answer their questions. A website is commonly used for:

-Selling products/services
-Providing product/company information
-Providing customer support

Typical questions that an e-retailer needs to answer are:

- How to increase browser to buyer conversion rate?
- How to increase web retention rate? (Defined as ratio of number of browsers who return to the web site within certain window of time to the total number of browsers.)
- How to reduce clicks-to-close value? (Smaller number indicates that customers are finding easier what they looking for. To reduce this value personalization of web services is a right approach.
- Does the web site design satisfy the needs of various customer segments?

Using page hits will NOT provide answer for any of these goals. Current traffic analysis tools are geared at providing high-level predefined reports about domain names, IP addresses, browsers, cookies and other machine-to-machine activity. These server activity reports simply do not provide the type of bottom-line analysis that e-tailers, service providers, marketers and advertisers in the business world have come to demand. These software packages (i.e., web analysis tools) originated from the need to report on the activity of the web server and not on the activity of the user.

Web mining may be subdivided into:
- Web-content mining
- Web-structure mining
- Web-usage mining.
- User profile data

Web-content mining is the mining of Internet pages, common in the next generation of XML/RKF-based search engines/Web spiders.
Web-structure mining is the application of data mining to reconstruct the structure of a Web site or sites.
Web-usage mining is mining of log files and associated data from a particular Web site to discover knowledge of browser and buyer behavior on that site. User profile data, such as demographic information about the users of the web-site, registration data and customer profile information can provide valuable information of its customers, and can be platform for segmentation and profiling. Web-usage mining is what is widely understood to be web mining and it is main subject of this introduction.

Goran Dragosavac


  1. Thanks for sharing your info. I really appreciate your efforts and I will be waiting for your further write ups thanks once again.

  2. This comment has been removed by the author.

  3. If you are new web mining and you have need data from other website. So there are many Web mining or Web Scraping Company which provide any kind of data which you help in business analysis. Loginworks Software are expertise in web mining so contact Loginworks on :

  4. This comment has been removed by the author.

  5. This information you provided in the blog that was really unique I love it!!, Thanks for sharing such a great blog Disaster Recovery-as-a-Service (DRaaS) Market Report | Network Analytics Market Report