<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-5491519805308268501</id><updated>2011-11-27T23:44:40.913Z</updated><category term='datamining'/><category term='text mining'/><category term='lexicons'/><category term='dataset'/><category term='machine learning'/><category term='sentiment analysis'/><category term='information retrieval'/><category term='feature selection'/><category term='SVM'/><title type='text'>Knowledge Discovery and Opinion Mining</title><subtitle type='html'>Explorations on current issues in opinion mining, knowledge discovery, tools and techniques.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>11</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-7992384270220495009</id><published>2011-09-25T18:00:00.001+01:00</published><updated>2011-09-25T23:19:51.375+01:00</updated><title type='text'>Sentiment Classification at RCOMM 2011</title><summary type='text'>


Earlier this year I gave a presentation on Sentiment Classification at the 2011 RapidMiner User Conference in Dublin. I have posted the slides on Slideshare.


RCOMM 2011 - Sentiment Classification







View more presentations from bohanairl.

There is an extended experiment based on what has been discussed in this blog, but now running on RapidMiner 5. The original word vector model is </summary><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/7992384270220495009/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2011/09/sentiment-classification-at-rcomm-2011.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/7992384270220495009'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/7992384270220495009'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2011/09/sentiment-classification-at-rcomm-2011.html' title='Sentiment Classification at RCOMM 2011'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-4431358572330544105</id><published>2011-02-02T21:39:00.009Z</published><updated>2011-02-02T22:35:46.088Z</updated><category scheme='http://www.blogger.com/atom/ns#' term='sentiment analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='lexicons'/><title type='text'>Sentiment Classification and Opinion Lexicons</title><summary type='text'>Lexicons are a big part of my current research in opinion mining. Aside from the potential of helping supervised learning methods, they can be applied to unsupervised techniques - an appealing idea for research whose goal is domain independence. An opinion lexicon is a database that associates terms with opinion information - normally in the form of a numeric score indicating a term's positive or</summary><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/4431358572330544105/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2011/02/sentiment-classification-and-opinion.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/4431358572330544105'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/4431358572330544105'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2011/02/sentiment-classification-and-opinion.html' title='Sentiment Classification and Opinion Lexicons'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_iwI4DT8tGxQ/TUnVvdxKAqI/AAAAAAAAAVs/OvS-euDiWpg/s72-c/swnscoring.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-638843308798096914</id><published>2009-09-08T21:36:00.017+01:00</published><updated>2009-09-08T23:09:15.897+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sentiment analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='datamining'/><category scheme='http://www.blogger.com/atom/ns#' term='feature selection'/><title type='text'>Parameter Testing - Letting RapidMiner Do The Hard Work</title><summary type='text'>In a previous post we have discussed an example of how to perform text classification in RapidMiner, and we used a data set of film reviews against several word vector schemes to classify documents according to their overall positive or negative sentiment. In this tutorial we show how to look for better results by using RapidMiner's parameter testing feature and evaluate the effects of feature </summary><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/638843308798096914/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2009/09/parameter-testing-letting-rapidminer-do.html#comment-form' title='12 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/638843308798096914'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/638843308798096914'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2009/09/parameter-testing-letting-rapidminer-do.html' title='Parameter Testing - Letting RapidMiner Do The Hard Work'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_iwI4DT8tGxQ/SqbGqyJ0BSI/AAAAAAAAAN8/nG7rlOQfsKM/s72-c/image1.JPG' height='72' width='72'/><thr:total>12</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-5883164778237011171</id><published>2009-04-19T17:33:00.003+01:00</published><updated>2009-04-19T17:39:47.933+01:00</updated><title type='text'>CFP for 1st International Workshop on Topic and Opinion Mining</title><summary type='text'>Got that from From KDNuggets...A workshop dedicated to topic and opinion mining being held in Hong Kong later this year as part of ACM's CIKM '09, and has recently issued a CFP.Key dates as per the main page:Individual workshop papers due: July 20, 2009Notification of Acceptance: August 10, 2009Camera ready: August 15, 2009 *(hard deadline for publication in proceedings)*Early registration </summary><link rel='related' href='http://sites.google.com/site/tsa2009workshop/' title='CFP for 1st International Workshop on Topic and Opinion Mining'/><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/5883164778237011171/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2009/04/cfp-for-1st-international-workshop-on.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/5883164778237011171'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/5883164778237011171'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2009/04/cfp-for-1st-international-workshop-on.html' title='CFP for 1st International Workshop on Topic and Opinion Mining'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-341720792096450104</id><published>2008-07-11T11:16:00.004+01:00</published><updated>2008-07-11T14:58:25.939+01:00</updated><title type='text'>Digital Memories - A Google TechTalk</title><summary type='text'>In this video, Steve Whittaker from Sheffield University talks about recent research in the area of Digital Memories - storing information about personal events spanning an entire lifetime in digital format, with obvious opportunities for mining all kinds of interesting patterns. It can also open up a particular branch of mining applications specifically geared at providing personalized, easy to </summary><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/341720792096450104/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2008/07/digital-memories-google-techtalk.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/341720792096450104'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/341720792096450104'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2008/07/digital-memories-google-techtalk.html' title='Digital Memories - A Google TechTalk'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-8610931882821657360</id><published>2008-06-23T22:01:00.019+01:00</published><updated>2008-07-01T23:29:57.496+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sentiment analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='text mining'/><category scheme='http://www.blogger.com/atom/ns#' term='SVM'/><title type='text'>Opinion Mining with RapidMiner - A Quick Experiment</title><summary type='text'>In this post I'll use the polarity data set from Bo Pang / Lilian Lee to perform a text classification experiment on RapidMiner. RapidMiner (formerly Yale) is a open source data mining and knowledge discovery tool written in Java, incorporating most well known mining algorithms for classification, clustering and regression; it also contains plugins for specialized tasks such as text mining and </summary><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/8610931882821657360/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2008/06/opinion-mining-with-rapidminer-quick.html#comment-form' title='11 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/8610931882821657360'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/8610931882821657360'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2008/06/opinion-mining-with-rapidminer-quick.html' title='Opinion Mining with RapidMiner - A Quick Experiment'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>11</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-3324872634588376863</id><published>2008-05-10T22:19:00.008+01:00</published><updated>2008-07-01T23:33:20.268+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dataset'/><title type='text'>Experiment Databases</title><summary type='text'>A repository with results on data mining experimentsThe Experiment Databases tool makes empirical results from AI data mining experiments more accessible and reusable. The site hosts a query engine in SQL format, retrieving results from mining experiments recorded into their repository. These can then be easily studied, and compared to other similar experiments - something that could otherwise </summary><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/3324872634588376863/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2008/05/experiment-databases-repository-for.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/3324872634588376863'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/3324872634588376863'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2008/05/experiment-databases-repository-for.html' title='Experiment Databases'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-7978216568669437869</id><published>2008-05-10T21:44:00.007+01:00</published><updated>2008-06-05T10:52:37.067+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='text mining'/><category scheme='http://www.blogger.com/atom/ns#' term='dataset'/><title type='text'>Quick Survey on Text Data Sets</title><summary type='text'>Here is a list of text data sets available on the web, with some comments on their content.1. UCI's Machine Learning Repository - A huge list of data sets on a variety of topics and formats in a searchable interface. There are 8 textual data sets available, including the popular Reuters  21578 and 20 NewsGroups data sets for text classification.2. TechTC Data Set is a repository of text documents</summary><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/7978216568669437869/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2008/05/quick-survey-on-text-data-sets.html#comment-form' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/7978216568669437869'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/7978216568669437869'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2008/05/quick-survey-on-text-data-sets.html' title='Quick Survey on Text Data Sets'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-6992750134336012233</id><published>2007-11-25T21:56:00.000Z</published><updated>2007-11-26T00:01:34.698Z</updated><title type='text'>Visual Complexity</title><summary type='text'>Stunning array of projects in knowledge visualization, many of them exploring techniques for visualizing textual data and its relations to other data sources. A follow on from many mining exercises, it opens up far wider choices for humans to relate to the original raw content.</summary><link rel='related' href='http://www.visualcomplexity.com/vc/' title='Visual Complexity'/><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/6992750134336012233/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2007/11/visual-complexity.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/6992750134336012233'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/6992750134336012233'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2007/11/visual-complexity.html' title='Visual Complexity'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_iwI4DT8tGxQ/R0nvwvIvFVI/AAAAAAAAADU/8ZFAaL8VLzE/s72-c/499_thumb.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-5248742889951239372</id><published>2007-11-25T18:57:00.000Z</published><updated>2007-11-28T23:49:56.204Z</updated><title type='text'>A Shortlist of Topics</title><summary type='text'>After much reading and pondering, here's a shortlist of potential topics that nicely relate to text mining and knowledge management (Some references are missing):1- Investigate the problem of quantification (Forman, HP Labs) in text classification on a specific domain. Useful in estimating positive cases, concept drift etc.2- Improving classification performance with features from discourse </summary><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/5248742889951239372/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2007/11/shortlist-of-topics.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/5248742889951239372'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/5248742889951239372'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2007/11/shortlist-of-topics.html' title='A Shortlist of Topics'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-5491519805308268501.post-5505126474921775119</id><published>2007-10-14T16:51:00.000+01:00</published><updated>2007-10-14T21:35:06.054+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='text mining'/><category scheme='http://www.blogger.com/atom/ns#' term='information retrieval'/><category scheme='http://www.blogger.com/atom/ns#' term='machine learning'/><title type='text'>Text Mining Video Lectures</title><summary type='text'>Taken from Ljubliana's Summer School on Semantic Web '05. More lectures available on http://videolectures.netA lecture on applications of document summarization for generating semantic information in the form of graphs representing a given relationship on the text set.Learning Semantic Sub-graphs for Document Summarization, Marko GrobelnikInformation Extraction LectureInformation extraction, </summary><link rel='replies' type='application/atom+xml' href='http://kmandcomputing.blogspot.com/feeds/5505126474921775119/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://kmandcomputing.blogspot.com/2007/10/learning-semantic-sub-graphs-for.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/5505126474921775119'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/5491519805308268501/posts/default/5505126474921775119'/><link rel='alternate' type='text/html' href='http://kmandcomputing.blogspot.com/2007/10/learning-semantic-sub-graphs-for.html' title='Text Mining Video Lectures'/><author><name>Bruno Ohana, Dublin, Ireland</name><uri>http://www.blogger.com/profile/17485902015298972883</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_iwI4DT8tGxQ/RxJ8D1vccnI/AAAAAAAAACs/ka30mRhCSKg/s72-c/Text+Mining.jpeg' height='72' width='72'/><thr:total>0</thr:total></entry></feed>
