Monthly Archives: January 2013

The Rise of Data Products

by Sean Murphy I had the great opportunity to present at the kick-off event for the Mid-Maryland Data Science Meetup on “The Rise of Data Products”. Below is the talk captured in images and text. Update: You can also download the … Continue reading

Posted in Data Science MD, Events, Reviews | Tagged , , , | 2 Comments

D3.js Meta Tutorial

 by Sean Murphy D3.js, the follow up to Mike Bostock’s impressive and useful Protovis library, is a fantastic tool for building web-based, dynamic, data visualizations consumable by the masses. However, as anyone not familiar with javascript, jQuery, SVG, CSS and/or HTML, … Continue reading

Posted in Tutorials, Visualization | Tagged , , | 6 Comments

*THE* List of Data-Related Meetup Groups in the Greater Washington DC Area

Ever wondered what data-related meetups you might be missing? Well wonder no more because we have compiled (and will periodically update) a list complete with some interesting statistics for you.  Please note that most of these meetups can also be … Continue reading

Posted in Community, Events | 1 Comment

Weekly Round-Up: Programmers vs. Politicians, DNA Data Storage, Josh Sullivan, and Billionaire Visualization

Welcome back to the round-up, an overview of the most interesting data science, statistics, and analytics articles of the past week. This week, we have 4 fascinating articles ranging in topics from storing data on DNA to visualizing the Forbes … Continue reading

Posted in Round-Ups | Leave a comment

Getting Started with Recommendation Systems

Next Monday’s Data Science DC Meetup is on Recommendation Systems in the Real World, and features two outstanding local data scientists, Matt Bryan from WaPo Labs and Bryce Nyeggen from LivingSocial. To preview that event, we asked them to suggest … Continue reading

Posted in Data Science DC | 1 Comment

Immersive Data Visualization

There are two types of data visualization: discovering data relationships and imaging known relationships.  As engineers, scientists, analysts, etcetera, we are very familiar with exploring a new data set, a new space, and how visualizing that space helps us become … Continue reading

Posted in Round-Ups, Visualization | Leave a comment

A Better Mathematical Model of Viral Marketing

This is the second part of a four part series of blog posts on viral marketing. In part 1, I discuss the faulty assumptions in the current models of viral marketing. In part 3, I show the weird dynamics of viral … Continue reading

Posted in Projects | Tagged , | 11 Comments

The DC Data Weekly: Education Data Contests and Sources

This weeks DC Data Weekly is short but sweet and is directly aligned with our upcoming Data Business DC Meetup next Wednesday!   K-12 Education Data Challenge Apps4VA Open Competition Deadline – January 31, 2013! Apps4VA Open Competition challenges applicants … Continue reading

Posted in Community, Competitions, Data Innovation DC, GuestPost | Tagged , | Leave a comment

Google Compute Engine vs Amazon EC2 Part 2: Synthetic CPU and Memory Benchmarks

 by Sean Murphy Testing Assumptions In the last article, I examined pricing and feature differentiation between Google Compute Engine and Amazon EC2 instance types. Now, it is time to see if the last article’s key assumptions, that Google Compute Engine Units are … Continue reading

Posted in Reviews | Tagged | 4 Comments

Hackathons and DataDives

The Data Events DC calendar currently shows three Hackathons and DataDives over the next few months, and at least one other will be posted shortly. But what is a Hackathon or a DataDive? How are they different? And why would … Continue reading

Posted in Competitions, Events | Tagged | Leave a comment