Download the full report on Data Mining.
A brief introduction to data mining can be found here.
Product./Company | Description | Applications | Business Size | Skills |
---|---|---|---|---|
11Ants Model Builder | This is an Excel add-on that hides much of the complexity associated with data mining. Various models are created in the background and the most viable models are automatically selected. Moderate pricing and the availability of 11Ants Predictor scoring engine make it suitable for many types of application. | Generic tool suitable for many types of application. 11Antsanalytics also provide customer churn and response data mining applications. | Small to large. | User Analyst |
Angoss | Angoss provides a broad product set encompassing data mining and business intelligence. Ease of use is a main feature of the product set and rich graphical user interfaces are the order of the day. KnowledgeSTUDIO - advanced modelling including neural networks and unsupervised learning techniques. Includes market basket analysis. KnowledgeExcelerator - a tool for visual data discovery (finding patterns and summary statistics) and analysis of predictive variables. | Generic toolset, but with some vertical solutions  (Banking, high tech, insurance and Retail), and a cloud based sales and marketing solution. | Medium to Large | Analyst User |
Bayesia | A fairly unique set of capabilities through the exploitation of Bayesian Networks. Supports the whole model creation process and the integration of models into production systems. Bayesia Market Simulator supports the analysis of competing offers to a defined population, and other products which support powerful graphing and application integration. | These novel techniques are applicable to any industry and Bayesia has experience in most of them. | Medium to large. | Analyst |
DataDetective | Sentient Information Systems profiles DataDetective as an end-user tool with prediction, clustering, profiling, network analysis, graphical displays and fuzzy matching. Unstructured data types can be included in analysis. | A general purpose end user tool that has been used in banking, insurance, government departments and police forces. | Small to large. | User |
Data Mining | The Rule Induction Kit finds decision rules from data, and Enterprise Data Miner is a collection of programs for mining big data. The latter includes data preparation, data reduction and sampling, and prediction. | A generic toolkit suitable for most applications | Medium to Large | Analyst |
DataminerXL | An Excel add-on which supports the creation of models using regression, naive Bayes, neural networks, and support vector machines. Not an end-user product, but very good to for those with some familiarity with the territory. Modest pricing and a free version throttled to 1000 instances. | Generic tool for many data mining problems. | Small to medium. Large organizations might use it to prototype. | Analyst |
Estard Data Miner | This is profiled as an end user tool employing various wizards to provide ease of use. It supports decision rules and a statistics module. | General purpose but with particular applicability to marketing applications. | Small to large. | User |
FastStats | Apteco provide a whole range of tools for marketers including FastStats modeling. This employs decision trees, clustering and a patented method called Predictive Weight of Evidence. FastStats Discover is a visual tool for analysis and visualization. | Specifically targeted at the marketing function. | Medium to Large | User |
FICO | Model Builder covers the whole predictive model life-cycle with regression, logistic regression and neural networks. Large number of data transformation options. FICO also provides optimization solutions through linear and non-linear programming tools. | Broad applicability, but FICO has particular experience in financial services, retail, government and healthcare. | Large | Analyst |
GenIQ Model | Uses genetic programming to carry out a superior form of regression analysis. Automatically performs variable selection and specifies the model. | General purpose tool with particular applicability to marketing. | Medium to Large | Analyst |
GhostMiner | Fujitsu's GhostMiner assists with data preparation, model validation and supports multimodels like committees or k-classifiers. A rich graphical interface is provided. | General purpose tool and used in database marketing, sales, credit scoring, fraud detection and HR. Primary target audience is banking, telecomms and distribution companies. | Medium to Large | User Analyst |
JMP | JMP from SAS is primarily a highly visual statistical analysis tool, but with neural networks and clustering. | Widely used in science, research, engineering and marketing. | Medium to Large | User Analyst |
Knowledge Miner | Uses an interdisciplinary approach to creating self organizing models. In their words ' an inductive, statistical learning network technology using the cybernetical approach of self-organization including systems, information and control theory and computer science'! | Used by many large organizations in science, engineering and finance. | Medium to Large | User Analyst |
KXEN | This is an enterprise tool majoring on the use of support vector machines. There are several components for streamlining the model creation process through to deployment in production. Explorer streamlines the data preparation process, Modeler is a high level modeling environment and Scorer supports deployment of predictive models into production. | Mainly targeted at predictive analytics, with specific tools for social network analysis and Genious, which provides marketers with their own toolset. A cloud based offering is targeted at sales and marketing functions and specifically Salesforce users. | Medium to large. | User Analyst |
Nuggets | Uses non-statistical methods for building, validating and applying models. The technology boasts a high tolerance for noisy data and high levels of productivity. Comes in two versions - Nuggets pro for the desktop and Nuggets Enterprise via client/server. | General purpose tool, but often used in marketing, finance and CRM applications. | Medium to Large | User Analyst |
Oracle Advanced Analytics | Oracle R Enterprise supplies the R library to the Oracle environment. Oracle Data Mining implements data mining algorithms that run as native SQL in the database. | The full functionality of R means that most applications can be addressed. | Large | Analyst |
Pentaho | Business intelligence and data mining capability, the latter being an implementation of weka. Support for Hadoop. | Generic capability. | Medium to Large | Analyst |
PolyAnalyst | Analysis of both structured and unstructured (text sepcifically) data, with report templates for presentation to management. | General purpose tool with many large customers in government, insurance, financial services, manufacturing and other verticals. | Medium to Large | Analyst |
Predictive Suite | Predictive Suite integrates statistics, predictive data mining methods (neural networks, genetic algorithms, fuzzy analysis) with a rich graphical interface. Predictive Engines supports the integration of predictive models with production systems. | Generic capability with applications in marketing, sales, insurance, financial services, energy industries and manufacturing. | Medium to Large | Analyst |
PredixionSoftware | Excel based modeling tools on the client side with server that can access most data sources including big data (Hadoop, Greenplum etc). | Wide applicability particularly in financial services, manufacturing and retail. | Medium to Large | Analyst User |
RapidMiner | Rapid-i provide an Enterprise Edition of one of the most widely used data mining modelling tools. RapidMiner is a very rich toolset supporting the complete model lifecycle. This is not an end-user tool, but requires considerable skills. The free version is very widely used - in fact possibly the most widely used analytics workbench. The graphical user interface is highly productive and the product supports most analytical techniques and data transformation tools. | Applicable to most data mining tasks. | Small to Large | Analyst |
RevolutionAnalytics | R is the most widely used, and arguably the most powerful analysis software on the planet. Revolution R Enterprise is built on open source R and has been enhanced for performance, productivity (through visual tools), and integration with enterprise data sources - and particularly Apache Hadoop for big data applications. Support and training services are bundled on top of the technology - something most organisations will require. A community edition of Revolution R is available for free. | A vast array of capabilities with all the R libraries. | Small to large. | Analyst |
Salford Systems | Salford Systems delivers a portfolio of products capable of traditional descriptive analytics (what has happened or is happening) and predictive analytics. The SPM Salford Predictive Modeler supports both traditional descriptive and predictive analytics. CART supports classification and the discovery of hidden relationships between attributes. MARS (Multivariate Adaptive Regression Splines) produces regression models and is seen as a complement to CART. | Broad capability with general applicability. | Medium to Large | Analyst |
SAS | One of the most experienced and best known suppliers in the analytics space. The product scope is broader than most and includes techniques such as Singular Spectrum Analysis - something that very few other suppliers provide. SAS has majored on a number of vertical (financial services, healthcare, insurance, retail etc) and horizontal solutions (supply chain, risk management, fraud, finance etc), providing a very rich mix of capabilities. | Both end user and analyst tools are provided and a large number of solutions for line of business and verticals. If it can't be done in SAS then it probably can't be done at all! | Small to large | User Analyst |
Statistica | Statsoft provides a very wide range of data mining and statistical analysis tool - pretty much everything you might ever need. It also provides a wide range of solutions to a variety of industries including banking, automotive, petrochemical, healthcare, insurance, telecomms and many others. | It is unlikely that Statistica would fall short on any valid requirement. Tools are provided for both analysts and end users, and solutions are available for most verticals. | Small to large. | User Analyst |