Monthly Archives: May 2013

Data Visualization: New Shiny Packages & Products

Over the past few weeks and months we’ve been exploring the new R web application framework Shiny, how we can develop in it, what its potential is, and what’s new.  As expected, web apps with Shiny are getting very sophisticated, and … Continue reading

Posted in Data Visualization DC, DataBlog, R, Resources, Reviews, Shiny, Visualization | Leave a comment

Stepping up to Big Data with R and Python: A Mind Map of All the Packages You Will Ever Need

On May 8, we kicked off the transformation of R Users DC to Statistical Programming DC   (SPDC) with a meetup at iStrategyLabs in Dupont Circle. The meetup, titled “Stepping up to big data with R and Python,” was an … Continue reading

Posted in Community, Python, R, Resources, Statistical Programming DC | Tagged , , , | 9 Comments

Weekly Round-Up: WibiData, Big Data Trends, Analytics Processes, and Human Trafficking

Welcome back to the round-up, an overview of the most interesting data science, statistics, and analytics articles of the past week. This week, we have 4 fascinating articles ranging in topics from Big Data trends to using data to fight … Continue reading

Posted in Round-Ups | Tagged , , , | Leave a comment

Data Visualization: Exploring Biodiversity

When you have a few hundred years worth of data on biological records, as the Smithsonian does, from journals to preserved specimens to field notes to sensor data, even the most diligently kept records don’t perfectly align over the years, … Continue reading

Posted in Data Visualization DC, DataBlog, Projects, Uncategorized, Visualization | Leave a comment

National Day of Civic Hacking Events in DC, MD, and VA

We’ve written before about Hackathons and Data Dives. Now, I’d like to bring your attention to a coordinated set of events happening the weekend of June 1st, called the National Day of Civic Hacking. The official web site describes it … Continue reading

Posted in Community, Competitions, Events | Tagged , , , | Leave a comment

PyAutoDiff: automatic differentiation for NumPy

We are excited to have a guest post discussing a new tool that is freely available for the Python community. Welcome, Jeremiah Lowin, the Chief Scientist of the Lowin Data Company, to the growing pool of Data Community DC bloggers. … Continue reading

Posted in Announcements, Python | Tagged , , , | Leave a comment

Data Community DC Video Series Kicks Off: Dr. Jesse English Talks NLP and Text Processing

We are excited to announce the first in a new series of posts and a brand new initiative: Data Community DC Videos! We are going to film and publish online videos (and separate audio, resources permitting) as many talks from … Continue reading

Posted in Community, Data Science MD, Python, Tutorials, Videos | Tagged , , , | Leave a comment

Evidence from Google IO: Recommendation Engines are not MVPs

My co-editor’s earlier post today about recommendation engines is simply spot on and I wanted to add not only my strong agreement but also some more anecdotal support for her conclusions. Google IO 2013, which concluded last week, was filled … Continue reading

Posted in Micro, Rant | Leave a comment

Why You Should Not Build a Recommendation Engine

Recommendation engines are arguably one of the trendiest uses of data science in startups today. How many new apps have you heard of that claim to “learn your tastes”? However, recommendations engines are widely misunderstood both in terms of what … Continue reading

Posted in Commentary, Data Science DC, Methods, Rant | Tagged , , , | 11 Comments

Weekly Round-Up: Google’s Quantum Computer, Data Science vs. Statistics & BI, Business Computing, and Detecting Terrorism Networks

Welcome back to the round-up, an overview of the most interesting data science, statistics, and analytics articles of the past week. This week, we have 4 fascinating articles ranging in topics from Google’s new quantum computer to detecting terrorist networks. … Continue reading

Posted in Round-Ups | Tagged , , , | Leave a comment