
May
Will Synthetic Data Take Over Market Research?
jerry9789 0 comments artificial intelligence, Burning Questions
Is Synthetic Data Replacing Consumer Research?
Lately interest in synthetic data has been gaining steam, judging from the conversations, posts and discussions around it. Easier access to advanced modeling tools, improved efficiency and effectiveness, as well as the opportunity for better privacy governance are seen as the driving forces for the surge in its popularity. Some are even marketing synthetic research not just as a solution but as a replacement for traditional, slower and often expensive research methodologies, presenting it as the faster, cost-effective and modern approach to consumer research. But is synthetic data indeed the future of market research?
Image: Darlene Alderson
What Is Synthetic Data?
Put simply, synthetic data is information that wasn’t directly collected from real world consumers or respondents. Instead, it’s artificially generated data produced by mathematical models or algorithms designed to mimic natural or real-world data.
It can be “fully synthetic,” meaning it was primarily created by algorithms with little direct connection to real respondents, or “partially synthetic,” where gaps in real data are filled in by AI. “Augmented” data, perhaps the more popular form of synthetic data, is simulated information built or extrapolated from a foundation of real-world information.
Aside from the benefits of speed, efficiency, and cost-effectiveness, synthetic data is helpful with various aspects of experimentation, such as preliminary testing, checking hypotheses, stress testing, iteration, and data fusion, even before any data is collected. It could help improve cases where sample data is small or limited because of difficulties acquiring real data or niche populations. And with rising expectations when it comes to governing privacy, synthetic data is being perceived as a solution to easily share and analyze sensitive information with lesser risks of identifying respondents.
Image: Sherin Sam
What’s Keeping Researchers From Embracing Synthetic Data?
While researchers acknowledge the benefits offered by synthetic data and are interested enough to explore the new realms it unlocks, there’s no general feeling of rushing to embrace the new hot tech. Rather, the push for adopting synthetic data seems to come more from research agencies and their marketing arms, rather than the researchers themselves or even their corporate clients.
So why aren’t more and more researchers jumping on the prospect of using synthetic data for their studies? Proponents of synthetic data extol its 80% match rate with real data; however, researchers recognize that that 20% divergence might make or break the research, as it could be where you’ll find the more nuanced opinions, emotion-driven responses, and meaningful differences.
There’s also the stigma associating the term “synthetic” with “fake.” There is distinction, however, between synthetic data and fake data, as the former is generated rather than invented like the latter. Synthetic data draws from real data so it reflects outputs that can be validated, tested, and compared; fake data isn’t afforded the same respect and measure of accountability.
Understandably, there are concerns about the quality of the data the AI models are fed on. Poor quality data can lead to oversimplification, overexaggeration, and bias reinforcement. Perhaps most importantly, researchers are concerned with losing the human element in synthetic data, that disconnect from genuine behavior which is revealed when observing how people naturally- and often spontaneously- express themselves. Human truths that are deeply tied to cultural, economic, and psychological factors, grounding insights in real-world behavior while elevating them from mere statistical guesswork.
In addition to AI hallucinations, synthetic data left to iterate by itself eventually produces nonsensical results. AI models have also been observed to be too eager to please, potentially discounting the opportunity for contrarian responses, unexpected perspectives and uncovering pain points which real participants often provide, potentially leading to groundbreaking insights and discoveries.
Image: Michelangelo Buonarroti
The Future Of Market Research with Synthetic Data
Synthetic data might be far from the game-changer vendors are hyping it up to be but researchers appreciate having it as another tool at their disposal. Synthetic research could work if you need to confirm or validate ideas quickly and while working on a budget, but it shouldn’t be expected to produce breakthroughs or unravel deeper levels of understanding the same way natural data does. It can help improve studies by filling in gaps but these would require validation as well as being transparent to stakeholders regarding the nature of the data behind the results.
Rather than being a direct replacement, synthetic research could serve study objectives and goals better by complementing, supporting and/or augmenting consumer research. Synthetic data alone would give everybody the same information, but adding human input and oversight could mean the difference in uncovering resonant insights with a level of confidence that truly drives or influences decisions and actions.
Additionally, synthetic data is also not a one-and-done solution. Human behavior and attitudes aren’t fixed and they change over time, so why should synthetic data remain the same and stagnate? To foster credibility and uphold confidence, synthetic data would require consistent updating and stringent control, as well as be verifiable and reflective of the real world.
Yes, synthetic data can be powerful, but by itself would falter without that all-important additional layer of humanity. Market research was, after all, founded on listening to real people, so synthetic data must be anchored in human truths to produce meaningful and relevant insights. AI-driven market research might be lauded by some as the way of the future, but it won’t spark anywhere near the same level of confidence that synthetic data empowered by human truth inspires.
Additional Reading:
Can Synthetic Respondents Take Over Surveys?
Trade Talk: Synthetic data: Intriguing, but is anyone actually sold?
Why We Don’t Talk About ‘Synthetic Data’—And Why You Shouldn’t Either
Synthetic data can benefit medical research — but risks must be recognized
Synthetic Data in Market Research: An Expert View on Why Natural Data First Still Wins
Featured and Top Images: cottonbro studio

Dec
Can AI Replace Human Respondents In Qualitative Research?
jerry9789 0 comments artificial intelligence, Brand Surveys and Testing, Brandview World, Burning Questions
Like most industries these days, market research is no stranger to AI with its broad applications including the employment of synthetic respondents, which are individual profiles constructed by Large Language Models (LLMs) from real or simulated data. They offer fast, cheap, and scalable synthetic data that closely mimics how human participants would respond, a boon for quantitative research. But can synthetic respondents be just as effective in qualitative research? Can AI-powered profiles fully take over the role of human respondents in market research?
Image: Diana
Synthetic Respondents and Qualitative Research
L&E Research recently hosted a webinar sharing their findings and observations testing synthetic respondents across a variety of qualitative research tasks. They shared that AI characteristically produces quick, structured, and consistent surface-level insights. It does well with detecting macro trends in usage or preferences, concept screening if you need to compare multiple ideas at scale, and spot issues with survey testing. It is also capable of gap-filling or simulating missing segments from known data, as well as bulk analysis for summarizing large open-ends quickly.
The key takeaway L&E found is that AI can describe what people do, but it falls short of telling why people do it. AI fundamentally excels in following patterns, but it would struggle with finding out the emotional driver, the motivation behind certain responses. AI can match logic but it won’t be able to fill in tone, nuance nor context like human insight and experience can.
Most AI models are also built on public data and may not have access to knowing how real people would respond to certain questions. When the engineers tried to influence AI agents in the direction of how real participants would respond, it rejected this notion and firmly stood by the perspective formed from the vastness of public data.
Additionally, AI can be absolutely and confidently wrong. Synthetic data can look convincingly human but since AI relies on patterns instead of experience, the air of confidence it puts up doesn’t guarantee accuracy.
Of course, the hosts added a disclaimer that this is where synthetic respondents are at right now, as no one could tell how things could possibly be so much different in the years to come. But the continued utilization of AI in market research- or any other industry, for that matter- is inevitable thanks to the operational and executionary efficiency it grants, and that is enough reason to continue studying and developing synthetic respondents.
Image: Ron Lach
Why The Human Factor Matters
In market research, emotions matter and context counts. AI can prove to be a powerful partner but it is no replacement for lived insight or validation. Human researchers are simply going to remain essential.
AI’s inherent structure and consistency is representative of its pursuit of perfection; however, humans aren’t perfect, nor simple. Humans are emotional and oftentimes, irrational. AI participants would respond based on their perfect approximation of how a human being would, but the synthetic logic behind that would be narrower and more consistent, as it discounts the fact that humans are imperfect.
Humans also bring incredible complexity and a broader range of perception to the table. We can contradict ourselves, and this would be natural. One human participant’s perception and experiences could inform the difference in how they respond from the next, while synthetic data would be uniformly shaped by congruence and invariability, no matter how much effort or work is put into making AI come close to mimicking humanlike responses.
The complexity, variability, and randomness of human nature is desirable in qualitative research. The engineers recognized this and cautioned about overly guiding or influencing randomness in AI that it “will hard-code your picture of randomness to the point where it is no longer random.”
AI can quickly give you bulk analysis but you might not want to rush in bringing it to your stakeholders, as they would question and challenge the quality and reliability of synthetic data. Human insight continues to be vital and irreplaceable when it comes to trust, nuance, and real-world complexity in market research.
Image: Kathrine Birch
The Hybrid Approach
At the end of it all, the hosts made a point that the webinar wasn’t meant to scare people away from synthetic data but rather bring a valid conversation on when it makes sense to take advantage or steer clear of AI-generated personas. In fact, they recommended utilizing a hybrid approach of employing virtual respondents and recruiting human participants, striking a delicate balance between synthesis and empathy.
Synthetic data would be great during the early exploratory stages of market research when you want to get an initial pulse check, something quick and good enough before getting people involved. But once you’re at the point when you need to uncover the emotional driver behind responses and decisions, understand or predict behaviors, or even gain a bit more confidence and trust in your findings, that’s when you bring in your human respondents.
This all aligns not only with a recent growing trend of companies coming around from the AI hype of the last few years but also with our stance on the appropriate use of AI, where we advocate for the responsible and ethical use of artificial intelligence. Instead of handing AI complete reins over all aspects of a business- or in this case, all stages of research work- we at Cascade Strategies encourage the thoughtful and practical application of artificial intelligence in combination with or enhanced by human experience, values and discretion.
To find out how our brand of inspired and enlightened human thinking can help you with your market research needs, please contact us here.
Additional Reading:
Can Synthetic Respondents Take Over Surveys?
Featured Image: Darlene Anderson
Top Image: Michelangelo Buonarroti

Aug
Can Synthetic Respondents Take Over Surveys?
jerry9789 0 comments artificial intelligence, Burning Questions

What Are Synthetic Respondents?
AI has increased operational efficiency by streamlining knowledge bases and shortcutting processes so it’s no surprise people and companies are looking for more ways for its application. For market research, one curious consideration is whether it could take over surveys, essentially by replacing actual respondents with synthetic respondents.
Also known as virtual respondents, digital personas, and Virtual Audiences, synthetic respondents are individual profiles constructed by Large Language Models (LLMs) from real or simulated data. Ideally, the data or descriptions used to generate these profiles come from previously conducted surveys and are combined with individual-level demographics, attitudes and behaviors.
Using these synthetic respondents over real respondents could benefit your research with speed, accuracy and cost savings, at least according to their advocates. Basically, you just need to conduct one survey and from the profile description or data you gathered from the actual respondents, you’re able to generate results from the constructed individuals over and over for succeeding studies and research.
Testing Synthetic Respondents
While synthetic respondents could accurately represent real respondents, relying exclusively on the results from these AI-based individuals may not be entirely beneficial. A webinar hosted by Radius Global took a closer look at the potential of AI-generated synthetic respondents through three case studies of quantitative concept testing, quantitative communications research, and qualitative communications research.
Aggregate results for the concept tests involving game controllers indicate somewhat strong similarities between the results of the real and synthetic respondents. This extends to the results from the quantitative communications research when it comes to the believability of statements on the benefits of milk, although there were some differences. The differences were much more pronounced though when it comes to surprise over the same statements, and there was incongruence when considering how each statement could possibly increase milk consumption.
The qualitative communications research was seeking in-depth insights into women’s needs, perceptions, and preferences for running a race or marathon, with the feedback gathered meant to be used for creative content. Personas were constructed from the profiles of six women aged between 18 and 64 years old who ran at least once in an average week. They had an LLM assume each persona to allow a comparison between findings from real participants to synthetic respondents.
They found that while both real and synthetic respondents have somewhat similar responses when it comes to functional aspects as goals for women in general pursuing fitness, the AI responses lacked emotional expressions. There are also little differences in the synthetic respondents’ responses despite having different profiles, and there was even a lack of subtle differences.
As for concerns among women who are aspirational marathon runners, the synthetic personas were consistent in their responses while the real respondents provided more nuances, variety, and perspectives more prevalent among women.
Synthetic Respondents vs. Real Respondents
Synthetic respondents appear to be useful if you’re evaluating existing ideas and concepts; however, if you’re looking for “breakthroughs” or essentially new insights you would’ve never arrived at had you not performed the case study or research, you would need to engage with real respondents, relying exclusively on their results or combining them with that of synthetic respondents. Yes, there could be cases where synthetic respondents could be used, but the results must be extensively validated. It would also require increasing the efficiency of how data used to construct these individuals are analyzed in addition to enhancing the quality of the data and information gathered for these profiles through thorough screening, intelligent probing, and smart choice models.
There is a place for synthetic respondents in market research, but as another tool in a researcher’s toolbox. They won’t be taking over surveys or replacing actual respondents wholesale anytime soon, it seems, as that elusive “Eureka” moment researchers seek is inherently tied to the nuances and perspectives of human emotion and experience you simply can’t construct.














