3.1 Critically discuss and illustrate using a diagram the steps involved in data mining when viewed as a process of knowledge discovery. (20)
3.2 State and explain the challenges to data mining regarding data mining methodology and user interaction issues. (10)
Question 3.1
Data mining as a knowledge discovery process involves the following steps:
Data cleaning:
· This is the process of removing impurities and inconsistencies in the raw data that has been mined.
Data integration:
· In this step, data collected from multiple sources is merged together to form a data storage commonly known as a database.
Data selection:
· This process involves analysis of data that is retrieved from the data warehouse.
Data transformation:
· This step involves creating data into forms that are good for mining and other operations
Data mining:
· This process involves application of appropriate methods to understand good patterns of data
Pattern evaluation:
· This is used to identify the patterns that are got from the previous step of data mining
Presentation of Knowledge:
· This is where knowledge that has been mined is visualized using many techniques to the knowledge that the end user can comprehend.
Question 3.2
Challenges to data mining regarding data mining methodology and user interaction issues
· Sometimes the data is incomplete and has very many inconsistencies making it hard to comprehend it.
· Data mining may be faced by an issue of complex data that makes it hard to mine and interpret for the final user.
· There are no well-established and efficient algorithms that can be used to mine data efficiently.
· There are many security and social challenges in the world of data mining.
· Incorporating background knowledge that helps guide discovery process for patterns.
Comments
Leave a comment