{{for|the non-fiction book|Small Data}} '''Small data''' is data that is 'small' enough for human comprehension.<ref>{{cite news|author=Rufus Pollock |url=https://www.theguardian.com/news/datablog/2013/apr/25/forget-big-data-small-data-revolution |title=Forget big data, small data is the real revolution | News |newspaper=The Guardian |access-date=2016-10-02}}</ref> It is data in a volume and format that makes it accessible, informative and actionable.<ref>{{cite web|url=https://whatis.techtarget.com/definition/small-data |title=What is small data? - Definition from WhatIs.com |website=Whatis.techtarget.com |date=2016-08-18 |access-date=2016-10-02}}</ref>
The term "big data" is about machines and "small data" is about people.<ref>{{cite web |author=Eric Lundquist |url=http://www.eweek.com/enterprise-apps/small-data-analysis-the-next-big-thing-advocates-assert.html/ |archive-url=https://archive.today/20140305170955/http://www.eweek.com/enterprise-apps/small-data-analysis-the-next-big-thing-advocates-assert.html/ |url-status=dead |archive-date=March 5, 2014 |title='Small Data' Analysis the Next Big Thing, Advocates Assert |website=Eweek.com |date=2013-09-10 |access-date=2016-10-02 }}</ref> This is to say that eyewitness observations or five pieces of related data could be small data. Small data is what we used to think of as data. The only way to comprehend Big data is to reduce the data into small, visually-appealing objects representing various aspects of large data sets (such as histogram, charts, and scatter plots). Big Data is all about finding correlations, but Small Data is all about finding the causation, the reason why.<ref name="Why Small Data Is the New Big Data">{{cite web|url=https://knowledge.wharton.upenn.edu/article/small-data-new-big-data/ |title=Why Small Data Is the New Big Data |website=knowledge.wharton.upenn.edu |access-date=2017-05-09}}</ref>
A formal definition of small data has been proposed by Allen Bonde, former vice-president of Innovation at Actuate - now part of OpenText: "Small data connects people with timely, meaningful insights (derived from big data and/or “local” sources), organized and packaged – often visually – to be accessible, understandable, and actionable for everyday tasks."<ref>{{cite web|url=https://smalldatagroup.com/2013/10/18/defining-small-data/ |title=Defining Small Data |date=18 October 2013 |publisher=Small Data Group |access-date=2016-10-02}}</ref> Another definition of small data is: * The small set of specific attributes produced by the Internet of Things. These are typically a small set of sensor data such as temperature, wind speed, vibration and status.<ref>{{cite web|url=https://www.forbes.com/sites/mikekavis/2015/02/25/forget-big-data-small-data-is-driving-the-internet-of-things/#4a72ffad661b |title=Forget Big Data - Small Data Is Driving The Internet Of Things |website=Forbes.com |access-date=2016-10-02}}</ref>
It was estimated (2016) that “If one takes the top 100 biggest innovations of our time, perhaps around 60% to 65% percent are really based on Small Data.”<ref name="Why Small Data Is the New Big Data"/> as Martin Lindstrom puts it. Small data includes everything from Snapchat to simple objects such as the post-it note. Lindstrom believes we become so focused on Big-Data that we tend to forget about more basic concepts and creativity. Lindstrom defines Small Data "as seemingly insignificant observations you identify in consumers’ homes, is everything from how you place your shoes on how you hang your paintings". He thus considers that one should perfectly master the basic (Small Data) in order to mine and find correlations.
==Academic Recognition and Methodology==
The growing significance of "small data" as a distinct field of inquiry was highlighted by the 2024 Thematic Einstein Semester (TES) on Small Data Analysis, hosted by the Berlin Mathematics Research Center MATH+<ref>{{Cite web |title=Berliner Mathematikforschungszentrum MATH+ |url=https://mathplus.de/ |access-date=2026-04-20 |language=en-US}}</ref>. A central focus of this semester was the transition from theoretical analysis to practical decision-making. Because small data sets are primarily used to drive specific actions, the presentation of results becomes an essential methodological step.<ref>{{Cite web |title=2nd Workshop – TES Winter 2023/24 – MATH+ |url=https://mathplus.de/topic-development-lab/tes-winter-2023-24/workshop-2/ |access-date=2026-04-20 |language=en-US}}</ref>
The semester’s findings emphasized that while small data may lack volume, it often contains a high density of "many possible interpretations." Consequently, the final conference of the TES was structured around the pillars of interpretation, explanation, and knowledge gain. Participants sought to develop new mathematical and methodical representations that could accurately depict this wealth of interpretative possibilities. This work underscores that analyzing small data is not purely a computational task; it requires a robust interface between mathematics and diverse disciplines to ensure that insights are both contextually grounded and scientifically rigorous.<ref>{{Cite web |title=Folge 1: Einführung - Thematisches Einsteinsemester Small Data Analysis {{!}} zib.de |url=https://www.zib.de/training-outreach/press/zibcast/episode-1 |access-date=2026-04-20 |website=www.zib.de}}</ref>
==Uses in business==
===Marketing===
Bonde has written about the topic for Forbes,<ref>{{cite web|url=https://www.forbes.com/sites/markfidelman/2012/10/30/these-smart-social-apps-bring-big-data-down-to-size/ |title=These Smart, Social Apps Bring Big Data Down to Size |website=Forbes.com |access-date=2016-10-02}}</ref> Direct Marketing News,<ref>{{cite web|url=http://www.dmnews.com/why-small-data-is-the-next-big-thing-for-marketers/article/308376/ |title=Why Small Data Is the Next Big Thing for Marketers - DMN |website=Dmnews.com |date=2013-08-22 |access-date=2016-10-02}}</ref> CMO.com<ref>{{cite web|last=Bonde |first=Allen |url=http://www.cmo.com/features/articles/2013/11/20/think_small_time_for.html |title=Think Small: Time For Marketers To Move Beyond The Big Data Hype |website=Cmo.com |date=2013-12-12 |access-date=2016-10-02}}</ref> and other publications.
According to Martin Lindstrom, in his book, Small Data: "{In customer research, small data is} Seemingly insignificant behavioural observations containing very specific attributes pointing towards an unmet customer need. Small data is the foundation for breakthrough ideas or completely new ways to turnaround brands."<ref>{{cite web|url=https://www.martinlindstrom.com/small-data/ |title=Small Data - Martin Lindstrom - Bestselling Author |publisher=Martin Lindstrom |access-date=2016-10-02}}</ref> His approach is based on the combination of the observation of small samples with intuition.<ref name= Forbes16>{{cite journal|last1=Dooley|first1=Roger|title=Small Data: The Next Big Thing|journal=Forbes|date=16 February 2016|url=https://www.forbes.com/sites/rogerdooley/2016/02/16/small-data-lindstrom/#115c00837870|access-date=8 May 2017}}</ref> Marketers can obtain market insights from gathering Small Data by engaging with and observing people in their own environments.<ref name="Forbes16" /> In comparison to Big Data, Small Data has the power to trigger emotions and to provide insights into the reasons behind the behaviours of customers.<ref name= MJ>{{cite journal|last1=Sarkar|first1=Christian|title="Small Data, Big Impact!" – An Interview with Martin Lindstrom|journal=The Marketing Journal|date=1 May 2016|url=https://www.marketingjournal.org/small-data-big-impact-an-interview-with-martin-lindstrom/|access-date=8 May 2017}}</ref> It may uncover detailed information on a person's extroversion or introversion, self-confidence, whether one is having problems in his/her relationship, etc.<ref name="MJ" /> According to Lindstrom, relationships among people and customer segments are organized around four criteria: # Climate: It reveals for example how a person's environment affects their diet. # Rulership: The power or government in charge # Religion: The prevalence of religion in a country, depending on its influence, indicates whether a person's decision making process is impacted by their belief system. # Tradition: Cultural norms influence people's behaviors and interactions.
Many companies underestimate the power of Small Data, using samples of millions of consumers instead of recognizing the value of closely observing small samples in their market research.<ref name="Forbes16" /> In his book, Lindstrom defines "7Cs", which companies should consider in the attempt to derive meaningful customer insights and market trends through small data from their customers:<ref name="MJ" /> # Collecting: Understanding the manner in which observations are translated inside a home. # Clues: Uncovering other distinctive emotional reflections that can be observed. # Connecting: Identifying the consequences of emotional behaviour. # Causation: Understanding what emotions are being evoked. # Correlation: Identifying the initial date of appearance of the behaviour or emotion. # Compensation: Identifying the unmet or unfulfilled desire. # Concept: Defining the “big idea” compensation for the identified consumer need.
Some of Lindstrom's clients such as Lowes Foods looked at data in a different way and actually chose to live with the customer. “As you enter their store, they have now created an amazing community where every staff member acts in a character mood, based on Small Data”.<ref name="Why Small Data Is the New Big Data"/> The supermarket made everything it can to make the customer feel at home. All the behaviours of employees are inspired by customer feedbacks gathered from interviews directly done at customer’s home.
===Healthcare===
Researchers at Cornell University started developing applications to monitor health problems in patients, based on small data. This is an initiative of Cornell's Small Data Lab,<ref>{{Cite web | title=Deborah Estrin | url=http://smalldata.io/ | access-date=2024-12-26 | website=smalldata.io}}</ref> in close cooperation with Weill Cornell Medicine College, led by Deborah Estrin.
The Small Data Lab developed a series of apps, focusing not only on gathering data from patients' pain but also tracking habits in areas such as grocery shopping. In the case of patients with rheumatoid arthritis for example, which has flares and remissions that do not follow a particular cycle, the app gathers information passively, thus allowing to forecast when a flare might be coming up based on small changes in behaviour. Other apps developed also include monitoring online grocery shopping, to use this information from every user to adapt their groceries to the recommendations of nutritionists, or monitoring email language to identify patterns that might indicate "fluctuations in cognitive performance, fatigue, side effects of medication or poor sleep, and other conditions and treatments that are typically self-reported and self-medicated".<ref>{{cite web|url=https://research.cornell.edu/news-features/small-data-and-big-health-benefits |title=Small Data and Big Health Benefits |website=research.cornell.edu |date=10 November 2015 |access-date=2017-05-15}}</ref>
=== Postal Service === The United States Postal Service (USPS) used optical character recognition (OCR) to automatically read and process 98% of all hand-addressed mail and 99.5% of machine-printed mail. By combining this technology with its small data sample of US zip codes, the USPS can now process more than 36,000 pieces of mail per hour.<ref>{{Cite web|url=https://about.usps.com/who-we-are/postal-facts/innovative-technologies.htm|title=Innovative Technologies - Postal Facts|website=about.usps.com|access-date=2017-11-07}}</ref>
=== Aerospace === In 2015, Boeing established the analytics lab for aerospace data in cooperation with the Carnegie Mellon University to leverage the university's leadership in machine learning, language technologies and data analytics.<ref>{{Cite news|url=https://www.cmu.edu/news/stories/archives/2015/october/boeing-analytics-lab.html|title=Boeing Establishes Analytics Lab For Aerospace Data at Carnegie Mellon - News - Carnegie Mellon University|last=University|first=Carnegie Mellon|date=October 2015|access-date=2017-11-07|language=en}}</ref> One of the initiatives projects aims to by standardize maintenance logs using AI to dramatically reduce costs.
Currently, there is no standardized procedure to document maintenance logs leading to small but highly unstructured data sets. As a result, it becomes highly difficult for maintenance workers to translate these variations in maintenance logs within a short period of time. However, with AI and a narrow data set of common aircraft maintenance terminology, it becomes possible to dynamically translate these logs in real time. By using AI to enhance the speed and accuracy of the airline maintenance workflow, airlines stand to save billions according to the Harvard Business Review.<ref>{{Cite news|url=https://hbr.org/2017/07/sometimes-small-data-is-enough-to-create-smart-products|title=Sometimes "Small Data" Is Enough to Create Smart Products|work=Harvard Business Review|access-date=2017-11-07}}</ref>
==References== {{Reflist}}
Category:Data management