Python Data Mining Resources

Python for data mining has been gaining some interest from data miner community due to its open source, general purpose programming and web scripting language. Below are some resources to kick start doing data mining using Python:

The Python Tutorial (updated ver 2.7) – read this manual first!
Orange – an open source data visualization/data analysis/data mining through visual programming or Python scripting.
Scrapy – fast high-level screen scraping and web crawling framework, used to crawl websites and extract structured data from web pages, completely written in Python. Use this to get any raw data from any site for data mining purposes.
Data Mining in Python – a collection of libraries useful for machine learning and data mining especially in clustering and supervised learning.
Pattern – a web mining module for the Python programming language. It contains tools for data retrieval, text analysis and data visualization and comes with over 30 sample scripts.

Continue Reading

Data Analytics News

Mzinga OmniSocial to Power World’s First Online Community for Big Data Analytics – Benzinga

Mzinga OmniSocial to Power World’s First Online Community for Big Data Analytics
Benzinga
Mzinga ® (http://www.mzinga.com), the leader in social intelligence solutions, services, and analytics for business, today announced that Teradata, the world’s largest company focused on big data analytics and data warehousing, has selected the Mzinga …
Mzinga OmniSocial to Power World’s First Online Community for Big Data AnalyticsBusiness Wire (press release)
all 6 news articles »
Experian Marketing Services Launches New Consumer Economic Barometer – Stockhouse

Experian Marketing Services Launches New Consumer Economic Barometer
Stockhouse
Experian Marketing Services, a leading provider of data, analytics and marketing technologies to help organizations target and engage their customers more effectively, today launched the Experian Consumer Expectation Index. Marketers now have access to …
Experian Marketing Services Launches New Consumer Economic BarometerPR Newswire (press release)
all 7 news articles »
The International ACM Knowledge Discovery and Data Mining Conference Brings … – San Francisco Chronicle (press release)

The International ACM Knowledge Discovery and Data Mining Conference Brings …
San Francisco Chronicle (press release)
For the first time, the conference also includes an Industry Practice Expo, a brand new track of invited talks from leading experts, who have developed and deployed successful, large-scale predictive analytics and data mining systems in their …
and more »
More uses seen on the horizon for content and text analytics tools – SearchDataManagement.com

More uses seen on the horizon for content and text analytics tools
SearchDataManagement.com
“Text and speech analytics will become ubiquitous,” Herschel predicted. Reading ‘the face of the customer’ through video analytics? Perhaps the most far-off type of unstructured data analytics is video and image analytics. Images are hard enough for a …
Wharton Research Data Services Announces the Addition of the WRDS SEC … – SunHerald.com

Wharton Research Data Services Announces the Addition of the WRDS SEC …
SunHerald.com
PHILADELPHIA — The Wharton School of the University of Pennsylvania announced today that Wharton Research Data Services (WRDS) has created the WRDS SEC Analytics Suite, allowing for researchers to effortlessly retrieve and export customized subsets of …
Wharton Research Data Services Announces the Addition of the WRDS SEC …Business Wire (press release)
all 8 news articles »
SAS’ ‘Big Analytics’ Tackles Big Data, Speeds Problem Solving – Insurance News Net (press release)

SAS’ ‘Big Analytics‘ Tackles Big Data, Speeds Problem Solving
Insurance News Net (press release)
“We rely on SAS to handle Expedia’s big data. SAS predictive analytics and data mining analyze nearly 200 terabytes of customer and clickstream data in our data warehouse. Customers need a reason to do business with Expedia. Analytic insights help us …
and more »
Zettaset raises $3M for large-scale data analytics – VatorNews

VatorNews

Zettaset raises $3M for large-scale data analytics
VatorNews
Founded in 2007, the business intelligence company focuses on providing large-scale data analytics services for the enterprise via the open-source platform Hadoop. The company’s new name, Zettaset, is a play on Zettabyte, which is one million petabytes …
Zettaset raises $3M for the consumerization of big dataGigaOm
GOTO Metrics Inks $3MPrivate Equity Hub (press release)
GOTO Metrics raises $3M, relaunches as ZettasetSan Jose Business Journal
EON: Enhanced Online News (press release)
all 13 news articles »
Public sector can make big savings through smart data usage – The Guardian

Public sector can make big savings through smart data usage
The Guardian
In their latest report, the business advisory firm outlines the need for the public sector to tighten up its use of data to drive down organisational costs and improve service efficiency. Deloitte found that data analytics – in particular predictive …
Data analytics ‘can bring down costs’Public Finance
all 2 news articles »
Data Analytics The Next Big Thing For India Inc: Pramod Haque – VC Circle

Data Analytics The Next Big Thing For India Inc: Pramod Haque
VC Circle
BY TEAM VCC The next set of innovation for India Inc. will revolve around the ‘Big Data’ or data analytics, according to Pramod Haque, managing partner at the US-based venture capital firm Norwest Venture Partners. Speaking at the recently concluded …
and more »
Sybase & Gartner: democracy needed in data warehouse analytics – ComputerWeekly.com (blog)

Sybase & Gartner: democracy needed in data warehouse analytics
ComputerWeekly.com (blog)
So then, we need to bring more democracy to data warehouse analytics I suppose. Also in the mix with this news announcement is the use of Sybase IQ PlexQ technology. This offering dynamically balances query workloads across nodes on the grid for …
Sybase announces availability of Sybase IQ 15.3 with PlexQ technologyistockAnalyst.com (press release)
all 2 news articles »

For More Information about Data Minining click here

Continue Reading

Data Analytics News July 13, 2011

Mzinga OmniSocial to Power World’s First Online Community for Big Data Analytics – Benzinga

Mzinga OmniSocial to Power World’s First Online Community for Big Data Analytics
Benzinga
Mzinga ® (http://www.mzinga.com), the leader in social intelligence solutions, services, and analytics for business, today announced that Teradata, the world’s largest company focused on big data analytics and data warehousing, has selected the Mzinga …
Mzinga OmniSocial to Power World’s First Online Community for Big Data AnalyticsBusiness Wire (press release)
all 6 news articles »
Experian Marketing Services Launches New Consumer Economic Barometer – Stockhouse

Experian Marketing Services Launches New Consumer Economic Barometer
Stockhouse
Experian Marketing Services, a leading provider of data, analytics and marketing technologies to help organizations target and engage their customers more effectively, today launched the Experian Consumer Expectation Index. Marketers now have access to …
Experian Marketing Services Launches New Consumer Economic BarometerPR Newswire (press release)
all 7 news articles »
The International ACM Knowledge Discovery and Data Mining Conference Brings … – San Francisco Chronicle (press release)

The International ACM Knowledge Discovery and Data Mining Conference Brings …
San Francisco Chronicle (press release)
For the first time, the conference also includes an Industry Practice Expo, a brand new track of invited talks from leading experts, who have developed and deployed successful, large-scale predictive analytics and data mining systems in their …
and more »
More uses seen on the horizon for content and text analytics tools – SearchDataManagement.com

More uses seen on the horizon for content and text analytics tools
SearchDataManagement.com
“Text and speech analytics will become ubiquitous,” Herschel predicted. Reading ‘the face of the customer’ through video analytics? Perhaps the most far-off type of unstructured data analytics is video and image analytics. Images are hard enough for a …
Wharton Research Data Services Announces the Addition of the WRDS SEC … – SunHerald.com

Wharton Research Data Services Announces the Addition of the WRDS SEC …
SunHerald.com
PHILADELPHIA — The Wharton School of the University of Pennsylvania announced today that Wharton Research Data Services (WRDS) has created the WRDS SEC Analytics Suite, allowing for researchers to effortlessly retrieve and export customized subsets of …
Wharton Research Data Services Announces the Addition of the WRDS SEC …Business Wire (press release)
all 8 news articles »
SAS’ ‘Big Analytics’ Tackles Big Data, Speeds Problem Solving – Insurance News Net (press release)

SAS’ ‘Big Analytics‘ Tackles Big Data, Speeds Problem Solving
Insurance News Net (press release)
“We rely on SAS to handle Expedia’s big data. SAS predictive analytics and data mining analyze nearly 200 terabytes of customer and clickstream data in our data warehouse. Customers need a reason to do business with Expedia. Analytic insights help us …
and more »
Zettaset raises $3M for large-scale data analytics – VatorNews

VatorNews

Zettaset raises $3M for large-scale data analytics
VatorNews
Founded in 2007, the business intelligence company focuses on providing large-scale data analytics services for the enterprise via the open-source platform Hadoop. The company’s new name, Zettaset, is a play on Zettabyte, which is one million petabytes …
Zettaset raises $3M for the consumerization of big dataGigaOm
GOTO Metrics Inks $3MPrivate Equity Hub (press release)
GOTO Metrics raises $3M, relaunches as ZettasetSan Jose Business Journal
EON: Enhanced Online News (press release)
all 13 news articles »
Public sector can make big savings through smart data usage – The Guardian

Public sector can make big savings through smart data usage
The Guardian
In their latest report, the business advisory firm outlines the need for the public sector to tighten up its use of data to drive down organisational costs and improve service efficiency. Deloitte found that data analytics – in particular predictive …
Data analytics ‘can bring down costs’Public Finance
all 2 news articles »
Data Analytics The Next Big Thing For India Inc: Pramod Haque – VC Circle

Data Analytics The Next Big Thing For India Inc: Pramod Haque
VC Circle
BY TEAM VCC The next set of innovation for India Inc. will revolve around the ‘Big Data’ or data analytics, according to Pramod Haque, managing partner at the US-based venture capital firm Norwest Venture Partners. Speaking at the recently concluded …
and more »
Sybase & Gartner: democracy needed in data warehouse analytics – ComputerWeekly.com (blog)

Sybase & Gartner: democracy needed in data warehouse analytics
ComputerWeekly.com (blog)
So then, we need to bring more democracy to data warehouse analytics I suppose. Also in the mix with this news announcement is the use of Sybase IQ PlexQ technology. This offering dynamically balances query workloads across nodes on the grid for …
Sybase announces availability of Sybase IQ 15.3 with PlexQ technologyistockAnalyst.com (press release)
all 2 news articles »

For More Information about Data Minining click here

Continue Reading

R Data Mining Resources

Data Mining using R has been gaining popularity among data miner/data analyst around the globe these days. A report from the Rexer’s Annual Data Miner Survey in 2010 stated that R has become the data mining tool used by more data miners (43%). According to Wikipedia, R is a programming language and software environment for statistical computing and graphics. R provides a wide variety of statistical and graphical techniques, including linear and nonlinear modeling, classical statistical tests, time-series analysis, classification, clustering, and others. Thus, I have compiled top resources about R data mining for your reference:

  1. R Project for Statistical Computing – the official R open source project website. Here you can get the latest release of R source code, manuals and recent bugs.
  2. R Books Website – list of latest books that are related to R and may be useful to the R user community. You may also like to read Data Mining with R book (data mining bestseller at Amazon.com).
  3. R in Wikipedia – here you can read basic info and example for R programming, including list of GUI for R and some references.
  4. Rattle: A GUI for Data Mining using R – a simple and logical graphical user interface based on Gnome that can be used by itself to deliver data mining projects. Rattle runs under GNU/Linux, Macintosh OS/X, and MS/Windows.
  5. R reference card for data mining – a collection of R packages and functions for data mining.
  6. R Bloggers – a central hub of news and tutorials contributed by (185) R bloggers.
  7. R Video Tutorials – a series of R for Statistical Programming screencasts that show you how to use R for for text mining. (some of the video links are missing)
  8. Reasons to learn R? – YouTube video describing why students should learn the R programming language.
  9. Programming R – online R programming resources from beginner to advanced resources.
  10. R Programming Wikibook – a place where anyone can share his/her tricks and knowledge on R.

For More Information about Data Minining click here

Continue Reading