NLP and the CIA World Factbook

NLP and the CIA World Factbook

The CIA World FActbook is a data behemoth well suited for Natural Language Processing experimentation. In this post we play with a K-means clustering algorithm to organize countries into their geographic region.

Hey Julia

Hey Julia

The forthcoming Julia programming language has scientists, number crunchers, and all kinds of data junkies excited over its promise to bring expressive and flexible syntax without compromising performance.

Deep Learning: Hospital Readmissions

Deep Learning: Hospital Readmissions

Readmissions are a thorny and expensive problem for hospitals, which is why there is a growing emphasis on identifying patients most likely to be readmitted before they are discharged. Does deep learning have a role to play in helping to i.d. patients most at risk for readmissions? Let’s look at the numbers…

Zeppelin: Data Analysis For the People

Zeppelin: Data Analysis For the People

As data becomes more prevalent across many industries, there is a greater need for tools that facilitate the democratization and collaboration of data analysis projects. One such application is Apache Zeppelin a web-based, open-source data analytics notebook.