Author Archives: Sean Murphy

About Sean Murphy

Sean Patrick Murphy, with degrees in math, electrical engineering, and biomedical engineering and an MBA from Oxford, has served as a senior scientist at Johns Hopkins University for over a decade, advises several startups, and provides learning analytics consulting for EverFi. Previously, he served as the Chief Data Scientist at a series A funded health care analytics firm, and the Director of Research at a boutique graduate educational company. He has also cofounded a big data startup and Data Community DC, a 2,000 member organization of data professionals. Find him on LinkedIn, Twitter, and .

DIDC Lean Data Product Development with the US Census Bureau – Debrief and Video

Thank you I want to thank everyone for attending DIDC’s May Meetup event, Lean Data Product Development with the US Census Bureau. This was our first attempt at helping bring potential data product needs to our audience and, based on audience … Continue reading

Posted in Data Innovation DC, Events | Tagged , , , | Leave a comment

A New Type of Meet Up Event?

Come join us the day after Memorial day for a new type of Meet Up. In the past, Data Innovation DC and Data Community DC have brought in fascinating speakers discussing data products and services that have already been built … Continue reading

Posted in Announcements, Community, Data Innovation DC | Tagged , , | Leave a comment

The Pragmatic Hackathon – Lean Customer Development for Data Products with the US Census Bureau

Interested in starting a company? It is summertime, the time for sequels. Our first event with the US Census Bureau was such a success that we are having a follow up event as part of the National Day of Civic … Continue reading

Posted in Announcements, Community, Data Innovation DC | Tagged , , , | Leave a comment

The Evils of Git

It would appear to many that Git and, as a result, GitHub, have taken over the world of code versioning. Having used git substantially more this past year, I can say that I am not blown away by all aspects … Continue reading

Posted in Commentary | Tagged , | Leave a comment

Former Obama For America and Living Social Data Scientists Show Off Their Startups – Data Innovation DC – Next Week!

Welcome Back! As a few people have mentioned, DIDC has been missing in action for January and February and, for that, we must apologize. We had an amazing sequence of events planned for the last two months that fell through … Continue reading

Posted in Announcements, Community, Data Innovation DC | Tagged , , | Leave a comment

The Fall of the P-Value

We at Data Community DC wanted to highlight a very interesting and relevant article for data practitioners published over at Nature.com. For most people, P-values are the “gold standard” by which the validity of scientific results are measured. However, mounting … Continue reading

Posted in Community | Tagged , | Leave a comment

Flask Mega Meta Tutorial for Data Scientists

Introduction Data science isn’t all statistical modeling, machine learning, and data frames. Eventually, your hard work pays off and you need to give back the data and the results of your analysis; those blinding insights that you and your team … Continue reading

Posted in Languages, Methods, Python, Resources | Tagged , , | 2 Comments

Expanding the Online Presence of Data Community DC – W3DC’s Strategic Plan for 2014

by Sean Murphy & Benjamin Bengfort W3DC handles online and technological aspects of Data Community DC. Its primary scope is the web domain at http://datacommunitydc.org as well as content and applications. Because of this, its natural responsibilities fall into several … Continue reading

Posted in Community, DataBlog | Tagged , , , | Leave a comment

A Tutorial for Deploying a Django Application that Uses Numpy and Scipy to Google Compute Engine Using Apache2 and modwsgi

by Sean Patrick Murphy Introduction This longer-than-initially planned article walks one through the process of deploying a non-standard Django application on a virtual instance provisioned not from Amazon Web Services but from Google Compute Engine. This means we will be … Continue reading

Posted in Commentary, Methods, Python | Tagged , , , , , | 1 Comment

Is Statistics the Least Important Part of Data Science?

There is a fascinating discussion occurring on Andrew Gelman’s blog that some of our Data Community DC member’s might want to chime in on … or discuss right here on our blog. There’s so much that goes on with data … Continue reading

Posted in DataBlog | Leave a comment