If you’re an early startup, you’re probably just getting started with your analytics and figuring out how people are using your product. For this purpose, an out-of-the-box tool like Google Analytics, Mixpanel, or Amplitude should be enough to handle your needs. Building out infrastructure to dump your data into AmazonS3, Postgres, or BigQuery may seem like overkill.
But when you take a step back and look one or two years ahead, you’ll discover it will be massively valuable for you to have the optionality to do any analysis you could possibly imagine without needing to create hacky exports from your analytics tools. This requires you to own and manage your own database.
In this lesson, we’ll share why you might want to own your data and your options for data storage.
Now versus later
Let’s start with now. Today, you are probably asking fairly simple questions with your data: Where do users get confused and drop off in my funnel? What’s my overall retention rate? What are my top sources of referral sign ups? These questions are easily answered in most analytics tools, and they’ll get you pretty far. Google Analytics, Mixpanel, and Amplitude let you easily make funnel reports, graphs, and dashboards. And you can make decisions as quickly as setting up the tool. Aside from implementing the tracking, there is no further engineering required.
But at a certain point, it will be hard for you to get to the bottom of a question or an unusual signal in your data. When you start to hit product-market fit and gain momentum, you’ll have more nuanced questions: How many users had multiple referral sources, and how do I weigh the value of each channel? What happens when a user starts their journey on mobile and finishes on web? Unfortunately, it may be difficult or even impossible for some tools to provide that depth and specificity in their reporting. If you don’t control your analytics data, you’ll be left with a choice: answer with the tools you have or don’t answer at all.
It’s at this point where owning your data gives you the leg up. Owning your data is valuable for two reasons:
- Storing data in your own data warehouse in a normalized format (not JSON blobs shoved into individual cells) and layering on BI tools or command-line SQL will give you the most flexible access to answer your business questions.
- Owning your data in a database you manage also future-proofs your marketing tech stack choices. You can always choose to move to a different tool because you can import your historical data into a new tool.