Table of contents
Share on Social Media
Nine types of research for thought leadership
Most consulting firms call something "research" when what they've actually done is read a few reports, charted what someone else found, and added a few opinions. That's not research. That's a literature review with decoration.
Thought leadership built on secondary sources gets ignored, because everyone else has access to the same sources. The work doesn't say anything that couldn't have been said by anyone else.
Research that drives thought leadership is different in four ways. It produces original data points that didn't exist before someone did the work. It uses a methodology that holds up when a sharp reader pokes at it. It generates findings designed to be quoted, charted, and dropped into a slide deck. And it answers a question the target audience genuinely cares about, not one the researcher finds interesting.
A bonus criterion separates the best programs from the rest: the research compounds over time. Year one is expensive. Year three is when the data becomes the citation.
What follows is a working menu. Nine forms of research that produce the kind of output worth building thought leadership around, each with a different cost profile, different distribution dynamics, and different long-term IP characteristics.
Here's a quick bird's eye chart:

1. Original surveys
Primary survey research with a defined audience, usually fielded through a panel provider or recruited directly from the firm's network. Questions are designed to produce specific data points, and the data points become the findings. This is the form most firms reach for first when they decide to invest in research.
The hardest part is recruitment, not analysis. A clean panel of 200 mid-level executives at $50 to $90 per complete runs $10,000 to $18,000, and C-suite respondents cost meaningfully more.
What it's good for. Producing the largest volume of original, citable findings per research dollar when designed well. A single survey can generate enough data points to fuel a year of derivative content if the question set is structured around what the audience actually wants to know. Surveys also lend themselves naturally to charts, percentages, and headline statistics, which travel further than narrative findings.
Challenges. The most common failure mode is designing a survey that confirms what the firm already believes. Findings that surprise nobody don't get quoted, because nobody can quote a finding that everyone already agreed with. The discipline is to write questions whose answers you can't predict, which is harder than it sounds.
Examples:
2. Indexes
A single, methodology-locked indicator that gets republished on a fixed schedule. The Consumer Sentiment Index releases every month. The Case-Shiller Home Price Index releases monthly. Once the methodology is set, you don't change it. That's the entire point.
The value compounds. In year one, an index is a research project that costs money to publish. In year five, it's a citation. In year ten, journalists and analysts quote it without checking the source firm, because the firm has become synonymous with the question.
What it's good for. Building research IP that appreciates over time. No other research form produces this kind of long-term ownership of a topic. Once an index has a multi-year track record, it becomes structurally hard for competitors to dislodge, because they would have to publish their own version for years before catching up.
Challenges. Indexes are expensive to start because the methodology has to hold up to decades of scrutiny. The design choices made in year one constrain everything that follows. Picking the wrong indicator or the wrong measurement frame is hard to fix once the index has a public track record, because any methodology change breaks the comparability that's the whole point.
Examples:
3. Benchmarking studies
Performance comparisons across firms, almost always built on data that the audience itself contributes. Respondents fill out the benchmark survey because they want to see where they rank against peers. The benchmark gets more valuable as more firms participate, which makes recruitment easier each year.
The participation loop is the whole reason this form works. You're not asking people to give you data out of generosity. You're trading them a benchmark for it. The exchange is what scales.
What it's good for. Generating distribution alongside the data. Every participating firm shares the report with internal stakeholders to argue for budget or to validate a strategic choice. That's free distribution into the buying committee of every respondent's firm. Benchmarks also produce some of the most quotable single statistics in research, because rankings and percentile cutoffs are inherently citation-friendly.
Challenges. Benchmarks live or die on participation rates and sample quality. A benchmark built on 30 respondents from your friend list is not a benchmark. It's a focus group with a chart. Building a credible benchmark takes meaningful upfront investment in recruitment and methodology, and a few years of running it before the data set is large enough to support the claims you want to make.
Examples:
4. Maturity models
A staged framework that places a firm, team, or process on a development curve, usually with three to five levels. The framework comes with a diagnostic so participants can locate themselves on the curve. CMMI levels run Initial, Managed, Defined, Quantitatively Managed, and Optimizing, and most maturity models follow a similar shape.
The structural feature that makes maturity models unique: the firm that builds the model is almost always the firm that sells what gets buyers from one level to the next. A digital transformation maturity model is built by a digital transformation consultancy. The finding ("you're at Level 2") and the solution ("here's what gets you to Level 4") come from the same source.
What it's good for. Embedding the firm's service offering inside the research finding. This makes maturity models structurally efficient as both research and pipeline tool, because the diagnostic itself produces a qualified buying conversation. A firm that scores low on the diagnostic has been told, in their own data, that they need help.
Challenges. The category is dense with vendor self-serving versions where the model conveniently shows that everyone is at Level 2 and only the publishing firm can move them up. Buyers see through this quickly. Credibility hinges on whether the model produces findings the publishing firm wouldn't have written on its own, and on whether the criteria for each level reflect something more than what the firm happens to sell.
Examples:
5. Longitudinal studies
The same study, fielded the same way, at regular intervals. Year over year comparisons become the central finding. Every annual release earns the right to claim what changed.
Closely related to indexes, but not the same thing. An index is a single locked indicator. A longitudinal study can be a full multi-question survey re-run annually, where the analytical value is in the patterns across questions and across years.
What it's good for. Generating a permanent editorial calendar item. A longitudinal study gives the firm a built-in annual publication moment with news baked in ("year over year, X moved from 47% to 62%"). It also creates an internal data set that gets richer with each release, which means later years' analysis can do things the first year couldn't, including segmentation, cohort tracking, and event studies tied to external shocks.
Challenges. The discipline is in the consistency. The temptation to change questions, swap panels, or adjust methodology between years has to be resisted, because any change breaks the year-over-year comparison that's the entire point. The studies that last decades are run by teams who treat methodological consistency as a non-negotiable, even when it costs them on individual releases.
Examples:
6. Proprietary data analysis
Research conducted on data the firm already owns. Client engagement data, internal operations data, product usage data if the firm has a product, anonymized client outcomes. The data was created in the course of doing the work. The research is in the analysis.
The data acquisition cost is effectively zero, because the data was already collected for other reasons. The work is in deciding what question to ask of it and producing analysis that holds up.
What it's good for. Producing the most defensible research possible. Nobody else can replicate it, because nobody else has the data. Competitors can run the same survey and get similar results. They can't run the same analysis of your internal data. For firms that sit on rich proprietary data sets, this is usually the highest-leverage form of research available.
Challenges. The constraint is generalizability. Whether the firm's internal data represents something broader than itself is the question every smart reader asks first. Stripe can publish on the internet economy because Stripe processes a meaningful share of internet commerce. A boutique firm with 40 clients can publish on those 40 clients but probably can't generalize. The honest framing matters. Readers can tell when a sample is being overclaimed.
Examples:
7. Public data analysis
Structured analysis of data that exists in public form. Government statistics, regulatory filings, scraped websites, conference programs, court records, academic data sets, industry databases. The data is available to everyone, which means the research lives or dies on the analysis.
The cost economics shifted dramatically in the last two years. Tasks that used to require a research assistant for a month, like collecting data from 500 websites or classifying job postings, now run in an afternoon with AI tooling.
What it's good for. Producing strong research at a fraction of the cost of survey-based work. Firms that didn't have research budgets two years ago can now produce serious public data analysis with a single analyst and a Claude API key. This is also the most flexible form, because the data is already there and a new analytical angle is usually one query away.
Challenges. Everyone else has the same tools, which means the bar for what counts as good public data analysis is rising fast. A sloppy analysis is worse than no analysis, because someone will check the work and the firm gets remembered for the error. The opportunity is real, but the analytical rigor required is higher than it was, even though the data is easier to get.
Examples:
8. Document analysis at scale
Systematic analysis of large bodies of text. Earnings call transcripts, 10-Ks, court filings, job descriptions, RFPs, news articles, regulatory comment letters, academic papers. The methodology is intuitive ("we read every X and counted Y") which makes the findings easy to defend.
This was impractical at any meaningful scale before large language models. A human research team could read maybe 200 earnings transcripts a week with reasonable consistency. An AI pipeline can process 20,000 with consistent classification.
What it's good for. Producing findings that feel definitive because the methodology is exhaustive rather than sampled. Counting the number of S&P 500 companies that mentioned AI in their last quarterly call. Tracking the shift in language about remote work across legal filings from 2019 to 2024. Documenting the change in skills requested in finance director job postings over five years. These are claims nobody else has bothered to make at scale, and they tend to surprise because volume reveals patterns no individual reader could catch.
Challenges. The trap is letting the volume of data substitute for the analytical question. Counting words in earnings transcripts is easy. Knowing which words to count, and what conclusion the counts actually support, is much harder. Document analysis that finds something surprising requires a real hypothesis to start. Findings of the form "we processed 50,000 documents and the most common word was 'the'" are technically correct and entirely useless.
Examples:
9. Structured qualitative research
Systematic interview programs across a defined population, not the scattered anecdotal interviews most firms call qualitative research. The discipline is in the structure: a defined sample frame, a consistent interview protocol, transcription and coding of responses, and findings that summarize across interviews rather than cherry-picking the best quote.
Qualitative work captures what surveys miss. Surveys ask the questions the researcher already knows to ask. Interviews surface the questions the researcher didn't know to ask.
What it's good for. Producing the texture and language that quantitative research can't generate. The combination of a survey and 20 to 40 structured interviews almost always produces a sharper report than either form alone. Qualitative work also produces pull quotes at volume, and a sharp executive quote spreads on social platforms better than a chart does, in part because it sounds like a person talking.
Challenges. It's slower and harder to scale than survey work, and most firms either skip it or do it sloppily as a result. Anecdotal interviews that didn't follow a protocol can't support claims about a population, which means the findings have to be framed carefully. The other risk is over-relying on a few articulate respondents whose quotes are good but whose views aren't representative.
Examples:
How to pick one and commit to it
The most common mistake is picking three forms and doing all of them badly. The second most common is picking one form and abandoning it after a year because the first release didn't break the internet.
A practical decision frame comes down to three questions.
First, what's the budget. Original surveys and full longitudinal programs cost real money. Public data analysis and document analysis at scale cost almost nothing once tooling is set up. If the annual research budget is under $20,000, start with public data or document analysis. If it's $50,000 or more, surveys and indexes become viable.
Second, does the audience want benchmarks or narrative. Benchmarking and maturity models work for audiences that want to know where they rank. Qualitative research and indexes work for audiences that want a story or a number to track. The audience determines the form, not the firm's preferences.
Third, do you want a one-time hit or compounding IP. A single survey will produce one good report. An index, a longitudinal study, or a recurring benchmark will produce ten years of compounding citation value. The choice is real, and the compounding forms are almost always the better long-term bet for firms with the patience to wait three years for the payoff.
Whatever the answer, the form matters less than the discipline of running it well for long enough to matter. Pick one. Run it for three years before evaluating whether it worked. Most firms quit at year one because year one looks underwhelming compared to expectations. The firms whose research everyone cites are the firms that didn't quit.
Nine types of research for thought leadership
Most consulting firms call something "research" when what they've actually done is read a few reports, charted what someone else found, and added a few opinions. That's not research. That's a literature review with decoration.
Thought leadership built on secondary sources gets ignored, because everyone else has access to the same sources. The work doesn't say anything that couldn't have been said by anyone else.
Research that drives thought leadership is different in four ways. It produces original data points that didn't exist before someone did the work. It uses a methodology that holds up when a sharp reader pokes at it. It generates findings designed to be quoted, charted, and dropped into a slide deck. And it answers a question the target audience genuinely cares about, not one the researcher finds interesting.
A bonus criterion separates the best programs from the rest: the research compounds over time. Year one is expensive. Year three is when the data becomes the citation.
What follows is a working menu. Nine forms of research that produce the kind of output worth building thought leadership around, each with a different cost profile, different distribution dynamics, and different long-term IP characteristics.
Here's a quick bird's eye chart:

1. Original surveys
Primary survey research with a defined audience, usually fielded through a panel provider or recruited directly from the firm's network. Questions are designed to produce specific data points, and the data points become the findings. This is the form most firms reach for first when they decide to invest in research.
The hardest part is recruitment, not analysis. A clean panel of 200 mid-level executives at $50 to $90 per complete runs $10,000 to $18,000, and C-suite respondents cost meaningfully more.
What it's good for. Producing the largest volume of original, citable findings per research dollar when designed well. A single survey can generate enough data points to fuel a year of derivative content if the question set is structured around what the audience actually wants to know. Surveys also lend themselves naturally to charts, percentages, and headline statistics, which travel further than narrative findings.
Challenges. The most common failure mode is designing a survey that confirms what the firm already believes. Findings that surprise nobody don't get quoted, because nobody can quote a finding that everyone already agreed with. The discipline is to write questions whose answers you can't predict, which is harder than it sounds.
Examples:
2. Indexes
A single, methodology-locked indicator that gets republished on a fixed schedule. The Consumer Sentiment Index releases every month. The Case-Shiller Home Price Index releases monthly. Once the methodology is set, you don't change it. That's the entire point.
The value compounds. In year one, an index is a research project that costs money to publish. In year five, it's a citation. In year ten, journalists and analysts quote it without checking the source firm, because the firm has become synonymous with the question.
What it's good for. Building research IP that appreciates over time. No other research form produces this kind of long-term ownership of a topic. Once an index has a multi-year track record, it becomes structurally hard for competitors to dislodge, because they would have to publish their own version for years before catching up.
Challenges. Indexes are expensive to start because the methodology has to hold up to decades of scrutiny. The design choices made in year one constrain everything that follows. Picking the wrong indicator or the wrong measurement frame is hard to fix once the index has a public track record, because any methodology change breaks the comparability that's the whole point.
Examples:
3. Benchmarking studies
Performance comparisons across firms, almost always built on data that the audience itself contributes. Respondents fill out the benchmark survey because they want to see where they rank against peers. The benchmark gets more valuable as more firms participate, which makes recruitment easier each year.
The participation loop is the whole reason this form works. You're not asking people to give you data out of generosity. You're trading them a benchmark for it. The exchange is what scales.
What it's good for. Generating distribution alongside the data. Every participating firm shares the report with internal stakeholders to argue for budget or to validate a strategic choice. That's free distribution into the buying committee of every respondent's firm. Benchmarks also produce some of the most quotable single statistics in research, because rankings and percentile cutoffs are inherently citation-friendly.
Challenges. Benchmarks live or die on participation rates and sample quality. A benchmark built on 30 respondents from your friend list is not a benchmark. It's a focus group with a chart. Building a credible benchmark takes meaningful upfront investment in recruitment and methodology, and a few years of running it before the data set is large enough to support the claims you want to make.
Examples:
4. Maturity models
A staged framework that places a firm, team, or process on a development curve, usually with three to five levels. The framework comes with a diagnostic so participants can locate themselves on the curve. CMMI levels run Initial, Managed, Defined, Quantitatively Managed, and Optimizing, and most maturity models follow a similar shape.
The structural feature that makes maturity models unique: the firm that builds the model is almost always the firm that sells what gets buyers from one level to the next. A digital transformation maturity model is built by a digital transformation consultancy. The finding ("you're at Level 2") and the solution ("here's what gets you to Level 4") come from the same source.
What it's good for. Embedding the firm's service offering inside the research finding. This makes maturity models structurally efficient as both research and pipeline tool, because the diagnostic itself produces a qualified buying conversation. A firm that scores low on the diagnostic has been told, in their own data, that they need help.
Challenges. The category is dense with vendor self-serving versions where the model conveniently shows that everyone is at Level 2 and only the publishing firm can move them up. Buyers see through this quickly. Credibility hinges on whether the model produces findings the publishing firm wouldn't have written on its own, and on whether the criteria for each level reflect something more than what the firm happens to sell.
Examples:
5. Longitudinal studies
The same study, fielded the same way, at regular intervals. Year over year comparisons become the central finding. Every annual release earns the right to claim what changed.
Closely related to indexes, but not the same thing. An index is a single locked indicator. A longitudinal study can be a full multi-question survey re-run annually, where the analytical value is in the patterns across questions and across years.
What it's good for. Generating a permanent editorial calendar item. A longitudinal study gives the firm a built-in annual publication moment with news baked in ("year over year, X moved from 47% to 62%"). It also creates an internal data set that gets richer with each release, which means later years' analysis can do things the first year couldn't, including segmentation, cohort tracking, and event studies tied to external shocks.
Challenges. The discipline is in the consistency. The temptation to change questions, swap panels, or adjust methodology between years has to be resisted, because any change breaks the year-over-year comparison that's the entire point. The studies that last decades are run by teams who treat methodological consistency as a non-negotiable, even when it costs them on individual releases.
Examples:
6. Proprietary data analysis
Research conducted on data the firm already owns. Client engagement data, internal operations data, product usage data if the firm has a product, anonymized client outcomes. The data was created in the course of doing the work. The research is in the analysis.
The data acquisition cost is effectively zero, because the data was already collected for other reasons. The work is in deciding what question to ask of it and producing analysis that holds up.
What it's good for. Producing the most defensible research possible. Nobody else can replicate it, because nobody else has the data. Competitors can run the same survey and get similar results. They can't run the same analysis of your internal data. For firms that sit on rich proprietary data sets, this is usually the highest-leverage form of research available.
Challenges. The constraint is generalizability. Whether the firm's internal data represents something broader than itself is the question every smart reader asks first. Stripe can publish on the internet economy because Stripe processes a meaningful share of internet commerce. A boutique firm with 40 clients can publish on those 40 clients but probably can't generalize. The honest framing matters. Readers can tell when a sample is being overclaimed.
Examples:
7. Public data analysis
Structured analysis of data that exists in public form. Government statistics, regulatory filings, scraped websites, conference programs, court records, academic data sets, industry databases. The data is available to everyone, which means the research lives or dies on the analysis.
The cost economics shifted dramatically in the last two years. Tasks that used to require a research assistant for a month, like collecting data from 500 websites or classifying job postings, now run in an afternoon with AI tooling.
What it's good for. Producing strong research at a fraction of the cost of survey-based work. Firms that didn't have research budgets two years ago can now produce serious public data analysis with a single analyst and a Claude API key. This is also the most flexible form, because the data is already there and a new analytical angle is usually one query away.
Challenges. Everyone else has the same tools, which means the bar for what counts as good public data analysis is rising fast. A sloppy analysis is worse than no analysis, because someone will check the work and the firm gets remembered for the error. The opportunity is real, but the analytical rigor required is higher than it was, even though the data is easier to get.
Examples:
8. Document analysis at scale
Systematic analysis of large bodies of text. Earnings call transcripts, 10-Ks, court filings, job descriptions, RFPs, news articles, regulatory comment letters, academic papers. The methodology is intuitive ("we read every X and counted Y") which makes the findings easy to defend.
This was impractical at any meaningful scale before large language models. A human research team could read maybe 200 earnings transcripts a week with reasonable consistency. An AI pipeline can process 20,000 with consistent classification.
What it's good for. Producing findings that feel definitive because the methodology is exhaustive rather than sampled. Counting the number of S&P 500 companies that mentioned AI in their last quarterly call. Tracking the shift in language about remote work across legal filings from 2019 to 2024. Documenting the change in skills requested in finance director job postings over five years. These are claims nobody else has bothered to make at scale, and they tend to surprise because volume reveals patterns no individual reader could catch.
Challenges. The trap is letting the volume of data substitute for the analytical question. Counting words in earnings transcripts is easy. Knowing which words to count, and what conclusion the counts actually support, is much harder. Document analysis that finds something surprising requires a real hypothesis to start. Findings of the form "we processed 50,000 documents and the most common word was 'the'" are technically correct and entirely useless.
Examples:
9. Structured qualitative research
Systematic interview programs across a defined population, not the scattered anecdotal interviews most firms call qualitative research. The discipline is in the structure: a defined sample frame, a consistent interview protocol, transcription and coding of responses, and findings that summarize across interviews rather than cherry-picking the best quote.
Qualitative work captures what surveys miss. Surveys ask the questions the researcher already knows to ask. Interviews surface the questions the researcher didn't know to ask.
What it's good for. Producing the texture and language that quantitative research can't generate. The combination of a survey and 20 to 40 structured interviews almost always produces a sharper report than either form alone. Qualitative work also produces pull quotes at volume, and a sharp executive quote spreads on social platforms better than a chart does, in part because it sounds like a person talking.
Challenges. It's slower and harder to scale than survey work, and most firms either skip it or do it sloppily as a result. Anecdotal interviews that didn't follow a protocol can't support claims about a population, which means the findings have to be framed carefully. The other risk is over-relying on a few articulate respondents whose quotes are good but whose views aren't representative.
Examples:
How to pick one and commit to it
The most common mistake is picking three forms and doing all of them badly. The second most common is picking one form and abandoning it after a year because the first release didn't break the internet.
A practical decision frame comes down to three questions.
First, what's the budget. Original surveys and full longitudinal programs cost real money. Public data analysis and document analysis at scale cost almost nothing once tooling is set up. If the annual research budget is under $20,000, start with public data or document analysis. If it's $50,000 or more, surveys and indexes become viable.
Second, does the audience want benchmarks or narrative. Benchmarking and maturity models work for audiences that want to know where they rank. Qualitative research and indexes work for audiences that want a story or a number to track. The audience determines the form, not the firm's preferences.
Third, do you want a one-time hit or compounding IP. A single survey will produce one good report. An index, a longitudinal study, or a recurring benchmark will produce ten years of compounding citation value. The choice is real, and the compounding forms are almost always the better long-term bet for firms with the patience to wait three years for the payoff.
Whatever the answer, the form matters less than the discipline of running it well for long enough to matter. Pick one. Run it for three years before evaluating whether it worked. Most firms quit at year one because year one looks underwhelming compared to expectations. The firms whose research everyone cites are the firms that didn't quit.





