When a marketer buys a mailing list of “homeowners with household income over $150K who purchased vitamins in the last 90 days,” that data didn’t appear from thin air. Every record on every mailing list traces back to real data sources – some obvious, some surprising.
Understanding where this data comes from helps you evaluate list quality, set realistic expectations, and choose the right data for your campaign.
The Seven Major Data Sources
1. Public Records
Government agencies collect enormous amounts of data as part of their normal operations. Most of this data is public by law, and data compilers aggregate it at scale.
Property records are among the richest public data sources. County assessor and recorder offices maintain records on every property transaction, including:
- Owner name and mailing address
- Purchase price and date
- Assessed value and square footage
- Mortgage amount and lender
- Property type (single family, condo, multi-unit)
This is how list compilers know you’re a homeowner, what your home is worth, and when you bought it.
Vehicle registrations from state DMVs provide data on vehicle make, model, year, and registered owner. This feeds automotive mailing lists and is used to infer income levels.
Voter registration files include name, address, party affiliation, and voting history. Some states include date of birth, phone number, and registration date.
Business filings from secretaries of state provide company name, registered agent, filing date, and business type. This is a foundational source for B2B compiled lists.
Professional licenses from state licensing boards identify doctors, lawyers, real estate agents, contractors, nurses, and dozens of other professions.
2. Self-Reported Surveys and Registrations
Millions of consumers voluntarily share personal information through surveys, product registrations, sweepstakes entries, and warranty cards. This data is then aggregated and made available for marketing.
Survey data captures information that public records can’t:
- Hobbies and interests (gardening, fishing, golf, cooking)
- Health conditions and wellness interests
- Religious affiliation
- Pet ownership (type, breed, number)
- Political opinions and causes
- Planned purchases (new car, home renovation, vacation)
When you fill out a product registration card or enter a sweepstakes, the fine print typically includes consent for the data to be shared with marketing partners. This is one of the most common ways lifestyle and interest data enters the mailing list ecosystem.
3. Transactional and Purchase Data
This is the most valuable category of mailing list data – and the hardest to get. Transactional data comes from actual purchases, subscriptions, donations, and other financial actions.
Mail-order and e-commerce buyers: Companies that sell products through catalogs or online often make their customer lists available for rental through list managers. These lists include what the customer bought, when, how much they spent, and how they ordered (mail, phone, or online).
Magazine subscribers: Publication subscriber files are among the most widely rented lists in the industry. A subscriber to Barron’s signals financial interest. A subscriber to Field & Stream signals outdoor recreation interest. The subscription itself is a behavioral signal.
Donors and contributors: Nonprofits frequently rent their donor files to other nonprofits and to commercial mailers. Donor lists include giving amount, recency, frequency, and cause affiliation.
Credit and financial behavior: Credit bureaus (Experian, Equifax, TransUnion) provide modeled data on income ranges, net worth estimates, and credit activity. They don’t share individual credit scores, but they provide aggregated and modeled financial selects that compilers incorporate into their databases.
4. USPS and Address Data
The United States Postal Service maintains several databases that are essential to the mailing list industry:
National Change of Address (NCOA): When someone submits a change-of-address form with USPS, that data becomes available to licensed NCOA processors. This is how new mover lists are compiled – and how existing lists stay current.
Delivery Sequence File (DSF): Contains every deliverable address in the United States. Used to validate addresses and identify residential vs. business locations.
Carrier Route and ZIP+4 data: Enables geographic targeting down to the individual mail carrier’s route.
5. Phone and Directory Data
Traditional white and yellow page directories, along with telecommunications data, provide phone numbers, addresses, and business classifications. While less comprehensive than they once were (many people have dropped landlines), this data still feeds consumer and business databases.
Reverse-append services can match a phone number to a name and address, or vice versa. This is how “phone append” and “email append” services work – matching one identifier against a larger database to fill in missing contact fields.
6. Digital and Online Behavior
Increasingly, mailing list data incorporates signals from online behavior:
- Website registration data: When you create an account on a website, that data may be shared with data partners (per the privacy policy)
- Cookie and device-level data: Browsing behavior is aggregated and matched to offline identities to create “intent” signals
- Social media profiles: Publicly available social data is used to enrich consumer records with interests, job titles, and affiliations
This category is evolving rapidly as privacy regulations (GDPR, CCPA, state-level laws) reshape what data can be collected and shared.
7. Business-Specific Sources
B2B mailing lists draw from additional sources beyond what’s available for consumer data:
- SEC filings: Public company data including revenue, employee count, and executive names
- Industry directories: Trade associations maintain member directories that feed B2B databases
- Job postings: Companies hiring for specific roles signal growth, technology adoption, and budget availability
- Import/export records: U.S. Customs data reveals international trade activity
- Patent filings: Indicate innovation activity by company
- Government contracts: Federal procurement data shows which companies sell to the government
How Compilers Turn Sources Into Lists
Raw data from these sources doesn’t become a mailing list on its own. Compilers run extensive processing:
- Aggregation: Data from dozens of sources is merged into a single record per individual or business
- Matching and deduplication: Multiple records for the same person are linked using name, address, phone, and email matching algorithms
- Standardization: Addresses are CASS-certified and standardized to USPS format
- NCOA processing: Records are run against the National Change of Address database to update moved addresses
- Deceased suppression: Records are checked against the Social Security Death Master File and other deceased databases
- Enhancement: Records are enriched with modeled data (estimated income, lifestyle clusters, purchase propensity scores)
- Segmentation: The compiled database is sliced into targetable lists by geography, demographics, behavior, and hundreds of other selects
What This Means for List Buyers
Understanding data sources helps you make better decisions:
Compiled lists vs. response lists: A compiled list of “high-income homeowners” is built from public records and modeled data – it tells you who someone is. A response list of “people who bought a $500 vitamin package by mail last month” is built from transactional data – it tells you what someone did. Response lists almost always outperform compiled lists because behavior is a stronger predictor than demographics.
Freshness matters: A list compiled from last month’s property records is more accurate than one compiled a year ago. Always ask when the data was last updated.
No list is perfect: Every data source has gaps and errors. Surveys have self-reporting bias. Public records lag behind reality. Transactional data only covers people who’ve already bought something. The best campaigns use data from multiple sources and test before scaling.
Ask about the source: When evaluating a mailing list, ask your broker or compiler where the data comes from. “Compiled from public records and surveys” tells you something different than “buyers from a nationally known catalog.” The source affects both accuracy and response rates.
The Privacy Side
All of this data collection operates within a legal framework. Key regulations include:
- CAN-SPAM Act: Governs commercial email – requires opt-out mechanisms and honest subject lines
- Telephone Consumer Protection Act (TCPA): Restricts telemarketing calls and texts, requires Do-Not-Call compliance
- Fair Credit Reporting Act (FCRA): Limits how credit-related data can be used for marketing
- State privacy laws (California CCPA/CPRA, Virginia VCDPA, Colorado CPA, etc.): Give consumers rights to know what data is collected and to opt out of its sale
- USPS regulations: Govern how NCOA and address data can be used
Reputable data companies comply with all applicable regulations and provide suppression processing (Do-Not-Mail, Do-Not-Call, deceased, prison) as part of their standard workflow.
Bottom Line
Every mailing list is only as good as its underlying data. The more you understand about where that data comes from – and how it’s processed – the better equipped you are to choose lists that will actually perform for your campaign. Don’t be afraid to ask your broker or compiler about their sources. The good ones will be transparent about it.