sql & data tools for pms
You need to be literate enough that you can do basic SQL. You're not expected to write production-grade queries — but you need to reduce the turnaround time, reduce the dependency on your data analyst.
The single biggest bottleneck in most product teams is not engineering capacity. It is the data request queue.
A PM has a hypothesis. They file a ticket to the analytics team. Three days later, the analyst sends back a CSV. The PM stares at it, realizes they asked the wrong question, files another ticket. Another three days. By the time they have an answer, the sprint is over and the decision was made on gut feel anyway.
This is the default state in most Indian product companies. And the PMs who break out of it all have one thing in common: they can answer their own questions. Not all questions. Not complex statistical modeling. But the 80% of questions that are just “show me rows from a table where this condition is true.”
That is what SQL gives you. Not a career differentiator — a survival skill.
How much SQL you actually need
Let me be direct: you need less than you think. I have watched PMs spend weeks on advanced SQL courses — window functions, recursive CTEs, query optimization — and come back to work still unable to answer the questions that matter.
Here is the list. If you can do these six things, you can answer 80% of your own data questions:
-
SELECT and WHERE — Pull specific columns, filter by conditions. “Show me all users from Bangalore who signed up last month.”
-
COUNT, SUM, AVG with GROUP BY — Aggregate data. “How many signups per city per week?”
-
JOIN two tables — Connect related data. “Match user signups with their first purchase.” Inner joins cover 90% of cases. Learn left joins when you need them.
-
Date filtering and date functions — Most PM questions are time-bound. “What happened last week vs. the week before?” If you cannot filter by date range, you cannot do trend analysis.
-
CASE WHEN — Categorize on the fly. “Bucket users into high/medium/low engagement tiers based on sessions per week.”
-
Subqueries — One query feeding into another. “Show me the retention rate for users who completed onboarding vs. those who didn’t.”
That is it. No window functions. No stored procedures. No database administration. You are not becoming a data engineer. You are becoming a PM who does not wait three days for a number.
Sprint retrospective at a fintech startup in Gurgaon. The team is reviewing why a feature launch had unclear results.
PM: “We launched the new referral flow three weeks ago. I still don't have the conversion numbers.”
Data Analyst: “It's in the queue. I've got five requests ahead of you from marketing and ops.”
PM: “Can you at least tell me the table name? I'll pull it myself.”
Data Analyst: “referral_events joined with user_signups. Filter on created_at after March 1st, and the event_type is 'referral_complete'.”
The PM opened the query editor, wrote a 6-line SQL query, and had the answer in ten minutes. The conversion rate was 4.2% — below the 8% threshold they'd set. The team pivoted the next sprint instead of waiting another week.
The PM who can pull their own data makes decisions in hours. The PM who cannot makes decisions in weeks — or not at all.
The SQL trap: when PMs go too far
Here is what I have seen go wrong. A PM learns SQL, gets excited, and starts spending half their day writing queries. They become the unofficial data analyst. Engineers start pinging them for numbers. The actual analytical work — interpreting the data, forming hypotheses, deciding what to build — gets crowded out by query writing.
You are not a data analyst. Your job is to ask the right questions and make decisions based on the answers. SQL is the tool that gets you answers faster. The moment you are spending more time writing queries than making product decisions, you have crossed the line.
Three rules to stay on the right side:
- If the query takes more than 15 minutes to write, hand it off. You are optimizing the wrong thing.
- If you need statistical rigor — significance testing, regression, causal inference — hand it off. SQL gives you counts and averages, not p-values.
- If the query touches production databases, never run it yourself. Use a read replica, a data warehouse, or a BI tool. One bad query on a production database can take down the product. This is not a theoretical risk — I have seen it happen.
Which analytics tools to learn (and in what order)
The tooling landscape is enormous and mostly irrelevant. Here is the sequence that gives you the fastest return:
Tier 1: Learn these first (week 1-2)
Google Analytics (GA4) — Free, available on almost every product. Flawed for product analytics (it was built for marketers, not PMs), but it is what you will encounter at most companies. Learn to set up events, build funnels, and create audience segments. Do not spend months mastering it — just get competent enough to pull your own traffic and conversion data.
Your company’s data warehouse query tool — This could be Redash, Metabase, Apache Superset, or a direct SQL client. Whatever your data team uses to query the warehouse, learn to use it. This is where your SQL skills become practical. Ask your data analyst to show you the schema, the key tables, and three example queries they run frequently. Copy those queries. Modify them. Build from there.
Tier 2: Learn these next (month 1-2)
Mixpanel or Amplitude or CleverTap — Product analytics platforms. They do what GA4 does poorly: track user behavior at the individual level, build cohort analyses, measure retention curves, and analyze funnels with user-level detail. In India, CleverTap has strong adoption — especially in consumer apps and fintech. If your company uses one of these, invest two weeks in learning it properly. The payoff is that most of your daily data questions become self-serve.
Tableau or Power BI or Looker — BI visualization tools. You do not need to become a dashboard designer. You need to know how to read existing dashboards, modify filters, and build a simple chart when you need to present data to leadership. Power BI is common in enterprises. Looker is common in startups that use Google Cloud. Tableau works everywhere but costs more.
Tier 3: Learn these when you need them
Hotjar or Microsoft Clarity — Session recording and heatmaps. Useful when you need to understand why users drop off at a specific step. Clarity is free and genuinely good. Use it when funnel data shows a problem but does not explain the cause.
Jupyter Notebooks (Python/Pandas) — For PMs who want to go deeper into data exploration. This is optional. If you find yourself doing the same complex analysis repeatedly and SQL is not enough, learning basic Pandas will save you time. But this is a “nice to have,” not a requirement.
Setting up your personal analytics stack
Here is what I recommend for any PM joining a new company. Do this in your first two weeks:
Day 1-3: Get access to everything. Do not wait for someone to offer you access. Request read-only access to the data warehouse, the product analytics tool, the BI dashboards, the error tracking system (Sentry or equivalent), and the APM tool (Datadog, New Relic, or equivalent). You will not use all of these immediately, but waiting for access later will cost you days.
Day 4-7: Learn the data schema. Sit with your data analyst for an hour. Ask them to walk you through the five most important tables: users, events, transactions (or whatever the core business action is), subscriptions, and sessions. Draw an entity-relationship diagram on paper. Understand how the tables connect. This is more valuable than any tool tutorial.
Day 8-14: Write your first ten queries. Start with the questions your team discusses in standups. “How many users signed up last week?” “What is our 7-day retention for the January cohort?” “How many users completed the new onboarding flow?” Write the SQL. Compare your answers with the official dashboards. If the numbers do not match, you just found either a bug in the dashboard or a gap in your understanding of the schema. Both are valuable.
The data conversation: working with analysts without annoying them
Data analysts are not your query-writing service. Treat them that way and they will deprioritize your requests — rightfully so.
What data analysts want from PMs:
-
Context, not just requirements. “I need the number of users who completed onboarding last month” is a request. “We’re trying to understand if the new onboarding flow improved activation — here’s our hypothesis, and here’s the specific metric we want to compare” is a collaboration. The second version gets you better answers because the analyst can challenge your assumptions before running the query.
-
Specificity about time ranges and cohorts. “Recent users” means nothing. “Users who signed up between January 1 and January 31, 2026, on the Android app” is something an analyst can work with.
-
Acknowledgment when their work drives a decision. When the analyst’s retention analysis convinced leadership to change the roadmap, say so publicly. Analysts are chronically under-recognized. A PM who gives credit builds an alliance that pays off every week.
What annoys data analysts about PMs:
- Filing vague requests and expecting the analyst to figure out what you actually want.
- Asking for “a quick pull” that turns out to be a complex analysis requiring new event instrumentation.
- Ignoring the data they provide because it does not support the PM’s preferred narrative.
- Re-asking the same question because the PM did not save or document the previous answer.
Product analytics vs. business intelligence: know the difference
PMs frequently conflate these two. They are different disciplines solving different problems.
Product analytics answers: What are users doing inside the product? Which features do they use? Where do they drop off? How does behavior differ between cohorts? Tools: Mixpanel, Amplitude, CleverTap, PostHog.
Business intelligence answers: How is the business performing? What are revenue trends? Where is customer acquisition cost trending? How does LTV compare across segments? Tools: Looker, Tableau, Power BI, Metabase.
You need both. Product analytics tells you what to build. Business intelligence tells you whether what you built is making money. The PM who only looks at product analytics builds features users love but the business cannot sustain. The PM who only looks at BI dashboards optimizes revenue but kills the user experience.
The practical move: spend 70% of your analytics time on product analytics (user behavior, funnels, cohorts) and 30% on business intelligence (revenue, unit economics, market trends). Adjust the ratio based on your company stage — early-stage startups skew more toward product analytics because the business model is still forming.
The AI shift: what changes and what does not
LLMs are changing the data tools landscape. Tools like ChatGPT’s Code Interpreter, Databricks AI, and even natural language query features in Metabase and Looker mean you can describe what you want in plain English and get SQL back.
This does not mean SQL is irrelevant now. It means the opposite. You need to know enough SQL to:
- Verify the output. An LLM can write a query that runs without errors but answers the wrong question. If you cannot read the SQL and spot that it is joining on the wrong key or filtering on the wrong date range, you will make decisions on bad data.
- Debug edge cases. The generated query works on 95% of your data but mishandles null values, timezone conversions, or duplicate rows. If you have zero SQL knowledge, you will not catch this.
- Communicate with precision. When you tell an analyst or an LLM “show me retention,” the quality of the output depends on how precisely you define what retention means for your product. SQL literacy gives you the vocabulary to be precise.
The PMs who will thrive are not the ones who memorize syntax. They are the ones who understand data structures well enough to ask precise questions — whether they type the query themselves or describe it to an AI.
Pick a product you use daily — Swiggy, Zerodha, Spotify, anything. Imagine you are the PM. Write SQL queries (on paper or in a text editor) for these five questions:
- How many new users signed up in the last 7 days?
- What is the daily active user count, grouped by day, for the last 30 days?
- What percentage of users who signed up last month completed the core action (first order, first trade, first playlist)?
- Which city has the highest 30-day retention rate?
- How many users completed the core action more than 3 times in the last week?
You do not need a real database. The exercise is about translating a business question into a structured query. If you can write these five queries correctly, you have enough SQL to be dangerous (in the good way).
Bonus: run your queries through ChatGPT or Claude and ask it to critique them. You will learn more from the critique than from the initial writing.
The tools that actually matter, ranked honestly
After training thousands of PMs, here is what I have observed about which tools actually get used vs. which ones people put on their resume:
Used daily: Excel/Google Sheets (still the workhorse for ad hoc analysis), the company’s product analytics tool, SQL query editor.
Used weekly: BI dashboards (Looker, Tableau, Power BI), session recording tools (Clarity, Hotjar).
Used occasionally: A/B testing platforms (VWO, Optimizely), customer feedback tools (Typeform, SurveyMonkey), cohort analysis tools.
Listed on resume but rarely used: Advanced Python/R, complex statistical tools, custom ML models.
The honest truth: Excel with good data beats Tableau with bad data. A PM who can write a clean SELECT ... GROUP BY ... ORDER BY in their company’s query tool and paste the result into a Google Sheet will outperform a PM who has Tableau certifications but does not understand the underlying data model.
Do not chase tool mastery. Chase data literacy. The tools change every two years. The ability to ask a precise question, get a number, and decide what it means — that skill is permanent.
You are a PM at a food delivery startup in Hyderabad. Your CEO wants to know why order frequency dropped 12% last month. The data team is swamped with a migration project and says they cannot look at it for a week. You have read-only access to the data warehouse.
You need to respond to the CEO by end of day. What do you do?
your path
What to learn next
Once you are comfortable with basic SQL and your company’s analytics stack, the next skills to develop are:
- Metrics That Matter — Knowing which numbers to pull is more important than knowing how to pull them.
- Data Interpretation — Raw numbers mean nothing without context, baselines, and an understanding of statistical traps.
- Diagnosing Metric Drops — The most common analytical task a PM faces, and the one where SQL skills pay off most directly.
- Growth Analytics — Applying data skills to acquisition, activation, and retention loops.
- Experimentation — When you graduate from descriptive analytics (“what happened?”) to causal analytics (“did this change cause that outcome?”).
A PM at Myntra gets a Slack message from the head of analytics: 'Your DAU report has been overcounting users. The query includes users who viewed a product but didn't log in. Our actual logged-in DAU is 18% lower than what's been reported to leadership for 3 months.'
The call: Do you escalate to leadership immediately, or fix the query and wait to see if anyone asks?
A PM at Myntra gets a Slack message from the head of analytics: 'Your DAU report has been overcounting users. The query includes users who viewed a product but didn't log in. Our actual logged-in DAU is 18% lower than what's been reported to leadership for 3 months.'
The call: Do you escalate to leadership immediately, or fix the query and wait to see if anyone asks?