Blog | Civis Analytics

Civis Blog

Did the Super Bowl 50 ads work? Data science provides an answer.

Did the Super Bowl 50 ads work? Data science provides an answer.

by Civis Analytics

Advertising during the Super Bowl is a huge investment. Brands spend upwards of $5 million on a 30-second spot because it’s the biggest TV advertising platform of the year. It’s an opportunity to reach over 100 million viewers with messages about brands, products and services – but what kind of return on investment (ROI) do these ads generate? How can...

Read More 

Open Source at Civis Analytics

Open Source at Civis Analytics

by Civis Analytics

Here at Civis Analytics, we love open source. We use Ruby on Rails, AngularJS, Docker, and Go (to name a few projects), and we’re happy to contribute back to the community and release our own open source projects. We’re pleased to announce our new open source page where we will publish our open source projects and list our policies. We...

Read More 

RubyAudit: Tirelessly auditing Ruby and RubyGems so you don’t have to

RubyAudit: Tirelessly auditing Ruby and RubyGems so you don’t have to

by Jeff C.

Security is hard. You’re busy writing code, but you also want to keep your application secure, so you’re doing double-duty developing new features and keeping an eye on vulnerabilities. You follow Hacker News and Reddit, but you know any good security strategy revolves around defense in depth, and you’re looking to add additional, automated tiers to help keep an eye...

Read More 

Data Science on State of the Union Addresses: Obama (2016) vs. Obama (2015) vs. … vs. George Washington (1790)

Data Science on State of the Union Addresses: Obama (2016) vs. Obama (2015) vs. … vs. George Washington (1790)

by Michael H.

Barack Obama recently gave his final State of the Union address, and since we’re interested in analyzing text data at Civis Analytics, I figured I ought to see if I could discover anything interesting. Rather than trying to understand the conversation on social media as we’ve done in previous work, I decided to take a somewhat longer view, comparing the...

Read More 

Connect the Civis Platform to Google Sheets: Let Your Drive Be Part of Your Data-Driven Culture

Connect the Civis Platform to Google Sheets: Let Your Drive Be Part of Your Data-Driven Culture

by Civis Analytics

Civis Analytics helps organizations across sectors use data science to improve outcomes. While working across multiple engagements and sectors, we’ve determined the most successful organizations invite their entire team to participate in building a data-driven culture by setting every employee’s sights on central metrics. Many of these successful organizations complement big data with Google Sheets, as they allow employees outside...

Read More 

Workflows in Python: Using Pipeline and GridSearchCV for More Compact and Comprehensive Code

Workflows in Python: Using Pipeline and GridSearchCV for More Compact and Comprehensive Code

by Katie M.

The last two posts in this series have been about getting a data science analysis quickly up and running, and then circling back to improve it or understand the patterns I find, for example, which algorithms are working best and why. The upshot was a better handle on my workflow, but I’m left with a lot of free parameters of...

Read More 

The Republican Primary So Far, in One GIF

The Republican Primary So Far, in One GIF

by Nick A.

Today, the New York Times featured some of our polling data and proprietary algorithm results and their implications for the Republican primary race ““ a mostly under-the-hood look at Donald Trump’s coalition. We wanted to give a bit of a different look at the same data at a local level, showing how support for the Republican candidates has changed over...

Read More 

Workflows in Python: Curating Features and Thinking Scientifically about Algorithms

Workflows in Python: Curating Features and Thinking Scientifically about Algorithms

by Katie M.

This is the second post in a series about end-to-end data analysis in Python using scikit-learn Pipeline and GridSearchCV. In the first post, I got my data formatted for machine learning by encoding string features as integers, and then used the data to build several different models. I got things running really fast, which is great, but at the cost...

Read More