According to Management Information Systems, “Data mining tools are the software tools you use to query information in a data ware house”.

The concept of Data mining is used decades ago in different industries and incredible progress as the years goes on that meets the needs of the user in decision making.
Evolution of business data to business information are Data Collection (1960s), Data Access (1980s), Data Warehousing and Decision Support (1990s) and last but not the least Data Mining (emerging today). Businesses collected data from surveys, research information and analyzed information and previous transactions that are large quantities inputted in databases. As the needed information of every business arises, the evolution of business data creates a way to diversify the information into a specific detail with a high percentage of accuracy and rapid response.
The Scope of Data Mining
Thearling discussed about the new business opportunities take place because of the two capabilities of data mining:
-Automated prediction of trends and behaviors, this answers can be easily detects from the data quickly even in the large databases.
-Automated discovery of previously unknown patterns, this data mining tools classify previously hidden patterns in one step in the databases.
This is very effective in Statistical Qualitative Control to avoid having deficiencies and errors in the production area. It will give a better forecasting that will minimize cost and will make a gain to compete in competitive advantage. Using the previous data, the abnormality will be easily detected based on the unknown patterns and having this information, management or operation management will immediately analyze the cause and decide which solution or strategy will be executed to the given situation.
A Database needs a reliable and appropriate Hard disk drive, RAM (Random Access Memory), ROM (Read Only Memory) so that it will perform well. The faster it works the more chances for the users to experiment and test all the data mining tools to get the appropriate solution and easily understand complex data.
According to Gartner Group Advanced Technology Research Note, “With the rapid advance in data capture, transmission and storage, large-systems users will increasingly need to implement new and innovative ways to mine the after-market value of their vast detail data, employing MPP [massively parallel processing] systems to create new sources of business advantage.”
The concept of data mining is known before and much more when the evolutions of technology boost in the market. The demand of the use of data mining is high and the development of this computer tools is continuous. Especially to the large business and organization, data mining tools makes the decision making easier to decide and quick to achieve the result.
There are different kinds of data mining tools available in the market and after choosing what data mining tools to use, building appropriate model is important.
Ever since building model are applied before the beginning of computers or data mining technology. Building model will help to know the answer in the certain situation using the data you have. If the formulated model is good, there is a high chance of getting the optimal answer you want to apply in the current situation.

An Algorithm is a step by step instructions created by the users to solve a problem in a specific situation. While in the concept of data mining, algorithms are formulated to find relations and in sets of data in a database.
Excerpted from the book Building Data Mining Applications for Customer Relationship Management by Alex Berson, Stephen Smith, and Kurt Thearling.
Two sections of the most common data mining algorithms use today:
-Classical Techniques: Statistics, Neighborhoods and Clustering
These techniques are widely used since early 1980. Statistics is a mathematical approach that is known to its probability which predicts the possibilities of outcomes that might occur or takes place. Neighborhoods is a technique that shows the similarities in the databases. While Clusters allows the data to group together based on the needed information of the user.

-Next Generation Techniques: Trees, Networks and Rules
It is the most common techniques used that build a predictive models, this leads to discover new information. Decision Tree breaks down the information in the databases that will show the useful information to the question you want to answer. Networks can detect patterns or trends in connected networks or nodes that can solve difficult problems and create decisions. A Rule has limitation to gather the needed data with the use of “if this and this and this then this”.
DATAMIN course and our Professor Mr. Duremdes at De La Salle Canlubang help us to somehow relates in deciding data mining tools to use in a certain situation in I.T perspective. Data mining tools are quick to run and give result at the minimum time, to determine the best techniques, trial and error can be done to see which techniques to use however decisions are made by the user whether to use the result in data mining tools or not. Knowledge worker should choose the most appropriate data mining tools based on what information in the database is all about, when to use the chosen techniques which leads to a competitive advantage, why it is needed in the business or organization and who will create and apply the data mining tools.
No comments:
Post a Comment