Author Archives: Benjamin Bengfort

About Benjamin Bengfort

Benjamin is a data scientist with a passion for massive machine learning involving gigantic natural language corpora, and has been leveraging that passion to develop a keen understanding of recommendation algorithms at Cobrain in Bethesda, MD where he serves as the Chief Data Scientist. With a professional background in military and intelligence, and an academic background in economics and computer science, he brings a unique set of skills and insights to his work. Ben believes that data is a currency that can pave the way to discovering insights and solve complex problems. He is also currently pursuing a PhD in Computer Science at the University of Maryland.

Natural Language Processing DC Discussion List

Data Community DC is pleased to announce a new service to the area data community: topic-specific discussion lists! In this way we hope to extend the successes of our Meetups and workshops by providing a way for groups of local people … Continue reading

Posted in Announcements | Tagged , , | Leave a comment

Confire: A new Python library

Announcing the release of a new open source library: Confire is a simple but powerful configuration scheme that builds on the configuration parsers of Scapy, elasticsearch, Django and others. The basic scheme is to have a configuration search path that … Continue reading

Posted in Announcements, Python | Tagged , , , , , | 3 Comments

Board Meeting Notes for January 2014

The Data Community DC Board Data Community DC (DC2) is a member driven organization that provides a truly open and welcoming community of people who produce, consume, analyze, and work with data. The organization is a coalition of several meetups, … Continue reading

Posted in Community, Management | Tagged | Leave a comment

Board Meeting Notes for December 2013

Board Meeting Notes for December 2013 The Data Community DC Board Data Community DC (DC2) is a member driven organization that provides a truly open and welcoming community of people who produce, consume, analyze, and work with data. The organization … Continue reading

Posted in Community, Management | Tagged , | Leave a comment

Ranking DC Software Companies for Fun and Employment

Are you a software engineer or data scientist in the DC area looking for a new project to work on? How do you select a company whose culture will fit your own? How do you look for the type of … Continue reading

Posted in Commentary, Community | Leave a comment

Board Meeting Notes for November 2013

The Data Community DC Board Data Community DC (DC2) is a member driven organization that provides a truly open and welcoming community of people who produce, consume, analyze, and work with data. The organization is a coalition of several meetups, … Continue reading

Posted in Community, Management | Tagged , | 1 Comment

Instructions for deploying an Elasticsearch Cluster with Titan

Elasticsearch is an open source distributed real-time search engine for the cloud. It allows you to deploy a scalable, auto-discovered cluster of nodes, and as search capacity grows, you simple need to add more nodes and the cluster will reorganize … Continue reading

Posted in Resources, Tutorials | Tagged , , , | 2 Comments

Getting Ready to Teach the Elephant to Read: A Strata + Hadoop World 2013 Tutorial

We (Ben Bengfort and Sean Murphy) are very excited to be holding the Teaching the Elephant to Read tutorial at the sold out Strata + Hadoop World 2013 on Monday, the 28th of October. We will be discussing and using … Continue reading

Posted in Announcements, Community, Text Analytics | Tagged , , , , , , , | 1 Comment

Entity Resolution for Big Data

A Summary of the KDD 2013 Tutorial Taught by Dr. Lise Getoor and Dr. Ashwin Machanavajjhala Entity Resolution is becoming an important discipline in Computer Science and in Big Data, especially with the recent release of Google’s Knowledge Graph and the open Freebase … Continue reading

Posted in Events, GuestPost, Reviews, Tutorials | Tagged , , , | 8 Comments

The State of Recommender Technology

Reblogged with permission from Cobrain. So let’s start with the big idea that is the reason that we are all here: recommendation engines. If you are reading this, you have probably already overcome the mental hurdle of the massive design … Continue reading

Posted in GuestPost, Round-Ups | Tagged , , , | 4 Comments