Browsed by
Tag: large data corpora

Why Nonadditive Entropy Is Important for Big Data Corpora Combinations

Why Nonadditive Entropy Is Important for Big Data Corpora Combinations

Non-Additive Entropy – A Crucial Predictive Analysis Measure for Data Mining in Multiple Large Data Corpora Statistical mechanics has an important role to play in big data analytics. Up until now, there has been almost no understanding of how statistical mechanics provides both practical value and a theoretic framework for data analysis and even predictive intelligence (sometimes called predictive analysis). This blogpost focuses on a related – and crucially important – issue: How can we determine the value of combining…

Read More Read More

Chapter 2 Review, Continued, Part 2 — "Automatic Discovery of Similar Words"

Chapter 2 Review, Continued, Part 2 — "Automatic Discovery of Similar Words"

(Direct continuation of yesterday’s post, w/r/t Senellart & Blondel on “Automatic Discovery of Similar Words” in Survey of Text Mining II. I give the references that cite, which I discuss in this post, at the end of the post.) In Chapter 2’s revieww of previous methods and associated literature, Senellart & Blondel start with banal and get progressively more interesting. The one thing I found interesting in the first model that Senellart and Blondel discussed was that the model was…

Read More Read More

"Automatic Discovery of Similar Words" – Chapter 2 in Survey of Text Mining II

"Automatic Discovery of Similar Words" – Chapter 2 in Survey of Text Mining II

This post begins a review of “Automatic Discovery of Similar Words,” by Pierre Senellart and Vincent D. Blondel, published as Chapter 2 in Berry and Castellanos’ Survey of Text Mining II. This is an excellent and useful chapter, in that it:1) Addresses the broad issue of computational methods for discovering “similar words” (including synonyms, near-synonyms, and thesauri-generating techniques) from large data corpora,2) Illustrates the different leading mathematical methods, giving an excellent overview of the SoA,3) Competently discusses how different methods…

Read More Read More