Athletic footwear global market share by company.
How about the information age? Just all of it. Maybe instead of information age, we could call it the saturation age, you know, because our brains are full to bursting. Or maybe just the overload age, gambling dataset stats. Or how about the age of inundation? One thing is certain, anyways.
Some of us are drowning in data, most of us are oblivious, and some lucky few are surfing on it. And that got me wondering: Well, one approach might be to download this archive ofpast Jeopardy questions and plug them into your favorite spaced repetition system.
Okay, this one I have thought about. Check out the Enron corpus. It contains more than half a million emails from about users, mostly senior management of Enron, organized into folders. Either sell it to law enforcement or to corporate executives as the finest cover-your-ass email system.
Wondering what the internet really cares about? What does Reddit care about? Someone has scraped the top 2. Now you can figure out with data! The grand casino hotel in tunica, mississippi use case was determining what domains are the gambling dataset stats popular.
Speaking of cats, here are 10, annotated images of cats. This ought to come in handy whenever I get around to training a robot to exterminate all non-cat lifeforms. The earliest recorded chess match dates back to the 10th century, played between a historian from Baghdad and a student.
As a consequence, today, students of the game benefit from one of the richest data sets of any game or sport. You can download it here. I can imagine an app that calculates your chess fingerprint, letting you know what grandmaster your play is most similar to, or an analysis of how play style has gambling dataset stats over time. On the topic of games, for soccer fans, I recently came across this freely available data set of soccer games, players, teams, goals, and more.
I imagine that this could come in handy for coaches gambling dataset stats to get an edge over opponent teams and, more generally, for that cross-section between geeks and gamblers attempting to build analytic models to make better bets. Google has put made all their Google Books n-gram data freely available. An n-gram is an n word phrase, and the data set includes 1-grams through 5-grams.
Someone register the domain clichealert. Amazon has a number of freely available data sets although I think you need to run your analysis on top of their cloud, AWSincluding more than 2. The possibilities are endless, but an old gambling dataset stats idea I had: Buy these up and then resell them to people involved in SEO. Or you could, you know, try to build the next Google.
How well do minorities do on the computer gambling dataset stats advanced placement exam? You can find out and tell me. A number of people have tried to build recommendation algorithms based on the data, including Kagglers and a team from Cornell. Or how about looking for a follow-the-leader effect. If one song goes viral with a unique style, do a bunch of copycats follow? Speaking of music data sets, last.
This would be good for clustering algorithms that automatically determine label genre or recommender systems. When I think geeks, I think math and computer geeks, but there are many more. Terry Pratchett geeks dated one! Yelp has a freely available subset of their dataincluding restaurant rankings and reviews. This would enable you to build out a Yelp competitor without requiring an active user base — you could just mine Twitter for data!
The top 5 are school comparisons, unemployment, population, sales tax, and salaries. This list would be a good first step in researching what sort of data comparisons people actually care about. Some of my readers are, no doubt, evil geniuses. Others want to save the world.
All the things we take for granted, like that every person has one father. It would be a pain to insert those 10 million facts by hand and, at a fact a minute, take more than 19 years. Thankfully, Freebase has done part of the job for youmaking more than 1. Maybe your plans are slightly less ambitious. In that case, check out the Mizar projectwhich has formalized more than definitions and theorems.
You long for someone you can connect with on a deeper level. Someone who can summarize any topic imaginable. In that case, you might want to feed your robot on Wikipedia data. While all of Wikipedia is freely availableDBpedia is an attempt to synthesize it into a more structured format.
Now, you get tired of mathematics and Wikipedia. But where to find the data for such a thing? You might start with downloading all 7. Actually, all the StackExchange data is freely available, so you could feed it more math information from both MathOverflow and the other math stackexchange.
Plus statistics from Cross Validated, and so on. Ever wanted to study true friendship? Well, now you can! Most revenue will come from sardine sales. Do left-leaning blogs more often link to other left-leaning blogs than right-leaning ones? And, thanks to permission from Lada Adamic, you can download her network of hyperlinks between weblogs on US politics, recorded in Or you casino carpet just read her paper.
You could find out by combining the dolphin data set mentioned earlier with Pablo M. What about s southern women or prisoners? How about fraternity members or HAM radio gambling dataset stats All this and more can be figured out with these network data sets. Well, then maybe you gambling dataset stats to develop some new-fangled trading algorithm and pick up like a trillion pennies from in front of the metaphorical steam-roller that is the market.
Market data which you can get here. The Open Product Data website aims to make barcode data available for every brand for free. Why, you ask, does the weather matter? The economic incentives for predicting the weather are absurd. When should you plant crops? Plan nassau bahamas casino big event?
Launch a space shuttle? Go deep sea fishing? I have a lot of respect for finance, mostly because of the crazy stuff they do. With weather data, it might know. A secret base on a planet outside of the casino jobs in biloxi missippi system.
If you need a database of comprehensive book data, perhaps to build a competitor to Goodreads or an online digital library, the Open Library allows people to freely download their entire database. Who is the United States killing with drones? Mnemosyne is a virtual flash card program that takes advantage of spaced repetition to maximize learning.
The project has been collecting user data for years, and gwern has graciously agreed to freely host the data for a few months. Perhaps gambling dataset stats could run some sort of unsupervised learning algorithm over it and try to discover heretofore unknown information about human memory. How much would it cost to hire Justin Bieber to play at your wedding?
The fine lads at Priceconomics have figured out how much it would cost to hire your favorite band. Inresearchers found that they could use data from twitter to do just that: A paper by Clifford Winston and Fred Mannering reports that vehicle traffic costs the United States billion dollars each year. One way to do this would be to feed an algorithm historical traffic data and then use that to predict hotspots, which you would route people around.
Lots of that data is available on data. UC Irvine has you covered. But maybe you instant echeck deposit casino us player to extend your spam-fighting service to text messages.
Still got you covered. There is a wealth of data sets available for R and all you have to do is install a package.Gambling Research Exchange Ontario Datasets Gambling Research Canadian Community Health Survey (CCHS) [Statistics Canada]. Datasets and statistical resources that include information about gambling in various populations. Reports, studies and data relating to the activities of the Victorian Commission for Gambling and Liquor Regulation (VCGLR) and information.