Category Archives: Resources

Announcing the Publication of Practical Data Science Cookbook

Four of DC2′s board members have published a new book! Tony Ojeda, Sean Murphy, Benjamin Bengfort, and Abhijit Dasgupta are proud to announce the arrival of Practical Data Science Cookbook. Practical Data Science Cookbook is perfect for those who want to learn … Continue reading

Posted in Announcements, Languages, Methods, Press Releases, Python, R, Resources | Tagged , , , , , , , , | Leave a comment

Announcing Discussion Lists! First up: Deep Learning

Data Community DC is pleased to announce a new service to the area data community: topic-specific discussion lists! In this way we hope to extend the successes of our Meetups and workshops by providing a way for groups of local … Continue reading

Posted in Announcements, Community, GuestPost, Resources | Tagged , , | Leave a comment

High-Performance Computing in R Workshop

Data Community DC and District Data Labs are excited to be hosting a High-Performance Computing with R workshop on June 21st, 2014 taught by Yale professor and R package author Jay Emerson. If you’re interested in learning about high-performance computing including concepts such … Continue reading

Posted in Events, R, Resources, Tutorials | Tagged , , , | 1 Comment

Facility Location Analysis Resources Incorporating Travel Time

This is a guest blog post by Alan Briggs. Alan is a operations researcher and data scientist at Elder Research. Alan and Harlan Harris (DC2 President and Data Science DC co-organizer) have co-presented a project with location analysis and Meetup … Continue reading

Posted in GuestPost, Meetup, Methods, Resources, Statistical Programming DC | Tagged , , | Leave a comment

Flask Mega Meta Tutorial for Data Scientists

Introduction Data science isn’t all statistical modeling, machine learning, and data frames. Eventually, your hard work pays off and you need to give back the data and the results of your analysis; those blinding insights that you and your team … Continue reading

Posted in Languages, Methods, Python, Resources | Tagged , , | 3 Comments

Ensemble Learning Reading List

Tuesday’s Data Science DC Meetup features GMU graduate student Jay Hyer‘s introduction of Ensemble Learning, a core set of Machine Learning techniques. Here are Jay’s suggestions for readings and resources related to the topic. Attend the Meetup, and follow Jay … Continue reading

Posted in Data Science DC, GuestPost, Meetup, Resources | Tagged , , , | 2 Comments

Instructions for deploying an Elasticsearch Cluster with Titan

Elasticsearch is an open source distributed real-time search engine for the cloud. It allows you to deploy a scalable, auto-discovered cluster of nodes, and as search capacity grows, you simple need to add more nodes and the cluster will reorganize … Continue reading

Posted in Resources, Tutorials | Tagged , , , | 2 Comments

General Assembly & DC2 Scholarship

The DC2 mission statement emphasises that “Data Community DC is an organization committed to connecting and promoting the work of data professionals…”, ultimately we see DC2 becoming a hub for data scientists interested in exploring new material, advancing their skills, … Continue reading

Posted in Announcements, Community, Data Science DC, Data Visualization DC, Resources, Sponsored, UX/UI, Visualization | Tagged , , , , , , , | Leave a comment

Python for Data Analysis: The Landscape of Tutorials

Python has been one of the premier general scripting languages, and a major web development language. Numerical and data analysis and scientific programming developed through the packages Numpy and Scipy, which, along with the visualization package Matplotlib formed the basis … Continue reading

Posted in Python, Resources, Tutorials | Tagged , , | 3 Comments

Data Visualization: Sweave

So you’re a data scientist (statistician, physicist, data miner, machine learning expert, AI guy, etc.) and you have the envious challenge of communicating your ideas and your work to people who have not followed you down your rabbit hole.  Typically … Continue reading

Posted in Commentary, Data Visualization DC, DataBlog, R, Resources, Reviews, Visualization | 1 Comment