Compensation Data Sources and How to Use Salary Surveys

Salary surveys and compensation databases form the empirical foundation of pay-setting decisions across industries, organization sizes, and geographies. Employers, compensation professionals, and policy researchers rely on these sources to benchmark roles, defend pay structures, and comply with emerging pay transparency laws. The quality, methodology, and scope of any given data source directly determines whether benchmark outputs are defensible or misleading.


Definition and scope

Compensation data sources are structured collections of pay information — base salary, total cash, and total remuneration — gathered from employers, employees, or administrative records and organized by job function, level, industry, geography, and organization size. A salary survey is a specific instrument: a systematic data-collection effort in which participating organizations report pay figures for matched job titles or benchmark positions, which a survey administrator then aggregates, cleans, and publishes as percentile distributions.

The scope of compensation data spans four primary categories:

  1. Published commercial surveys — produced by compensation consulting firms such as Mercer, Willis Towers Watson, and Korn Ferry; access typically requires participation or licensing fees, though their methodologies are publicly documented.
  2. Government-administered surveys — the U.S. Bureau of Labor Statistics (BLS) Occupational Employment and Wage Statistics (OEWS) program collects wage data from roughly 1.1 million employer establishments semi-annually (BLS OEWS Overview), representing the largest free public dataset of occupational pay in the United States.
  3. Association surveys — professional and trade associations — such as the Society for Human Resource Management (SHRM) or WorldatWork — administer surveys within specific sectors, disciplines, or member populations.
  4. Self-reported aggregators — platforms where individuals voluntarily submit compensation data; these carry the highest response-bias risk and are generally not considered primary references for formal compensation benchmarking.

The distinction between base salary vs. total compensation is critical in survey interpretation: a dataset reporting only base salary figures will systematically understate true labor market cost for roles where variable pay and incentive compensation constitutes a material share of pay.


How it works

Survey participation and data use follow a structured process with defined methodological controls.

Participation and submission. Participating employers map their internal job titles to the survey's benchmark job descriptions — a process called job matching. Accurate matching is the single largest driver of data quality. A mismatched submission (e.g., mapping a Director-level role to a Manager benchmark) introduces systematic error that aggregates across respondents.

Data cuts and weighting. Survey administrators segment submitted data by industry code (typically NAICS), revenue band, employee count, and geography. Percentile outputs — the 10th, 25th, 50th, 75th, and 90th percentiles — represent the distribution of pay rates across matched positions. The 50th percentile (median) is the most widely cited market reference point; the 75th percentile is a common target for organizations with aggressive talent acquisition strategies.

Aging and updating factors. Survey data ages from the moment of collection. A survey fielded in Q1 of a given year reflects pay decisions made months earlier. Compensation professionals apply aging factors — expressed as annualized percentage increases — to project historical survey data forward to a current effective date. The BLS Employment Cost Index (ECI) (BLS ECI) is a standard public source for these escalation rates.

Market composite construction. Most organizations blend data from 3 to 5 survey sources for any given job family, weighting each source by sample size, recency, and relevance to the organization's competitive labor market. The output — a blended market composite — feeds into compensation benchmarking and the establishment of job evaluation and pay grades.


Common scenarios

Setting a new hire salary range. When an organization adds a new role, the compensation function pulls relevant survey cuts — typically by industry, geography, and revenue band — to establish a pay range midpoint. The midpoint aligns to a defined market percentile target specified in the organization's compensation philosophy and strategy.

Annual merit cycle market review. Before setting merit budgets, compensation teams compare current pay ranges against updated survey data to determine compa-ratios (an employee's pay divided by the range midpoint, expressed as a percentage). Ranges with compa-ratios below 80 may indicate compression or market lag requiring an off-cycle adjustment. The broader merit pay and performance raises framework depends on range alignment as a prerequisite.

Pay equity analysis. Attorneys and HR professionals use survey data as the external reference point in pay equity and pay gaps audits. Unexplained pay variation relative to market benchmarks — after controlling for job level, geography, and performance — triggers deeper investigation under statutes enforced by the Equal Employment Opportunity Commission (EEOC) (EEOC Pay Discrimination).

Geographic differentiation. Organizations with distributed workforces use geographic pay differential data — often expressed as a location factor relative to a national average — to calibrate pay for remote or field-based employees. This intersects directly with geographic pay differentials and compensation for remote workers.


Decision boundaries

Not all data sources are equivalent for all use cases. The table below contrasts the two dominant reference types:

Dimension BLS OEWS (Government) Commercial Survey
Cost Free, public Participation or licensing required
Sample size ~1.1 million establishments Varies; typically 300–5,000 respondents
Update frequency Semi-annual Annual (most); some quarterly
Job match precision SOC occupation codes Benchmark job descriptions
Use in litigation Widely accepted as independent reference Accepted with methodology disclosure
Geographic granularity Metropolitan Statistical Area (MSA) level Varies by cut availability

The primary decision threshold involves defensibility. In formal contexts — litigation, regulatory review, or board compensation committee reporting for executive compensation — organizations require survey sources with documented, auditable methodologies and statistically meaningful sample sizes. Self-reported aggregator data fails this threshold in virtually all formal proceedings.

A secondary boundary concerns scope alignment. The key dimensions and scopes of compensation determine which data cut is relevant: a survey of all industries in a national geography produces meaningfully different percentiles than an industry-specific cut for the same occupation. Using an inappropriate scope — national generalist data to price a specialist role in a tight regional labor market — produces benchmarks that misrepresent actual hiring conditions.

For a structured overview of how compensation systems are organized across the United States, the compensation authority index provides a categorized reference to all major topic areas covered in this domain.


References

Explore This Site