(Back to the Crossroads Search main page)
A Timely Guide to the American Community Survey: From the US Census Long Form to the ACS (CQ Press)
The Timely Guide site, presented in blog format, is a section of the second edition of the Encyclopedia of the U.S. Census from CQ Press. The Guide was posted in December, in conjunction with the U.S. Census Bureau's release of the first set of American Community Survey (ACS) 5-year estimates of population down to the census tract and block-group level for all geographic areas. The guide includes an introduction, frequently-asked questions, history and methodology of the ACS, and information on interpreting ACS data.
AgeSource/AgeStats Worldwide (AARP International)
AARP International has created a pair of annotated databases, AgeSource and AgeStats, to identify and link to aging-related information online. AgeSource covers "clearinghouses, databases, libraries, directories, bibliographies and reading lists, texts and reports, national statistical resources, training materials, and Web 'metasites' focused on aging or closely allied subjects." AgeStats focuses on statistics comparing the situation of older adults between countries and regions. Users can keyword-search the annotations and also limit the search by type of resource (e.g. statistical resource) and geographic area. The AARP International site also features "country profiles", which consists of a table of quick aging-related stats such as life expectancy, statutory retirement age, and total health care expenditures per capita. .
Aging Statistics (U.S. Administration on Aging)
The "Aging Statistics" section of the U.S. Administration on Aging site links to the following databases and reports:
- Profile of Older Americans
- AGing Integrated Database (AGID)
- Census Data & Population Estimates
- Projected Future Growth of Older Population
- Minority Aging
- Key Indicators of Well-Being
American Mosaic Project (Doug Hartmann and Penny Edgell)
The American Mosaic Project is a multi-year, multi-method study of the bases of solidarity and diversity in American life. The principal investigators of this project are Doug Hartmann, Penny Edgell and Joseph Gerteis at the University of Minnesota, and the project is funded by the Edelstein Family Foundation of Minneapolis, MN.
The survey was designed to gather data on diversity with a particular emphasis on race and religion. A nationally-representative telephone survey was conducted during the summer of 2003; additional in-depth interviews and fieldwork across the country, explored the various contexts in which Americans experience diversity, focusing on religious interfaith organizations, neighborhoods, and festivals. Key findings on Discrimination, Atheism, Anti-Semitism, Racial attitudes of Conservative Christians, Diversity, Racial Attitudes and Religion, and Racial Privilege are available as press releases at this site.
American Time Use Survey (U.S. Bureau of Labor Statistics and U.S. Census Bureau)
In September 2004, the first data from the American Time Use Survey (ATUS) was released by the U.S. Bureau of Labor Statistics. This is the first federal, nationwide study of time use in the United States, with data collection beginning in 2003. The web site carries data tables in ASCII format, with accompanying documentation. A microdata release is planned in the future, and time-series will become available as the survey continues across multiple years.
American Time Use Survey Data Extract Builder (ATUS-X) (Minnesota Population Center)
From the ATUS-X web site: "The American Time Use Survey Data Extract Builder (ATUS-X) is a project dedicated to making it easy for researchers to use data from the American Time Use Survey (ATUS). The ATUS is an ongoing time diary study, started in 2003, that is funded by the U.S. Bureau of Labor Statistics (BLS) and fielded by the U.S. Census Bureau. It provides detailed information about the activities in which respondents engage together with extensive information about the characteristics of those respondents and other members of their households."
ATUS-X assigns uniform codes to the variables across years, and makes the documentation more accessible. With a free registration, users can create extracts from the data, selecting and downloading only the variables and years they need.
Asia Regional Information Center (ARIC) (The Economic Research and Regional Cooperation Department)
Formerly the Asia Recovery Information Center, ARIC was reinvented in 2006 as the Asia Regional Information Center, with a focus on regional cooperation and integration in Asia and the Pacific. Its Economic and Financial Indicators provides statistical data on external, real, and monetary sectors of 23 economies in Asia and South Asia. Indicators include: GDP, inflation, industrial production, and retail sales for the real sector; trade, exchange rates, and balance of payments for the external sector; and money supply, stock prices, and banking indicators for the monetary and financial sector. Data are presented in either growth rate, index, or ratio.
Bowling Alone: Research (Robert D. Putnam)
This web page links to the data that Robert Putnam used in his groundbreaking 2000 classic on social capital "Bowling Alone: The Collapse and Revival of American Community." DDB Lifestyle data (1975-1998) can be downloaded directly from the page, as can 14 state-level measures of social capital and the Comprehensive Social Capital Index. The page also links to data available from the Roper Center (UW-Madison subscribes).
Center for Demography and Ecology (University of Wisconsin-Madison)
Many excellent CDE Working Papers are available online.
Cultural Policy and the Arts National Data Archive (CPANDA) (Princeton University)
CPANDA is "is the world's first interactive digital archive of policy-relevant data on the arts and cultural policy in the United States."
Among the 36 study titles in their catalog (as of May 2010) are the following:
- American Community Survey: Artist Extract File
- Biennial Media Consumption Survey
- Brooklyn Museum Art Controversy Survey 1999
- Classical Music Consumer Segmentation Study 2001
- General Social Survey 1993 - 2002: Culture Module
- Information on Artists
- Museum Program Survey 1979
- National and Local Profiles of Cultural Support 1998-2001
- Performing Arts Research Coalition 2002
- State of the First Amendment
- Survey of Public Participation in the Arts
Data from Developing Countries (BREAD (Bureau for Research in Economic Analysis of Development))
This page is intended to help locate datasets pertaining to data from developing countries that is either in the public domain or that can be obtained at modest cost from the agency collecting the data. This site provides links to some of the data available online and explanations of how to obtain others. Links are grouped into household surveys, firm-level data, macro data sources, and national statistical offices.
Demographic and Health Surveys (DHS) (U.S. Agency for International Development (USAID))
The MEASURE DHS program facilitiates the Demographic and Health Survey, the Service Provision Assessment (SPA) Survey, and the HIV/AIDS Indicator Survey (AIS). The resulting datasets, which focus on Central and South America, Asia, Africa, and the Middle East, are available for direct downloading. Datasets include information on family planning, maternal and child health, child survival, AIDS, educational attainment, and household composition and characteristics. Online data tools include:
- the STATcompiler tool for quick facts and country comparisons, allowing users to build customized tables from DHS surveys and indicators.
- the HIV/AIDS Survey Indicators Database, for creating HIV/AIDS data tables from a variety of sources.
- the STATmapper tool for creating maps from DHS data in more than 75 countries.
Documentation and SPSS/PC and SAS data dictionaries are also available. Users must register and receive a password to download datasets.
Diversitydata.org (Heller School for Social Policy and Management, Brandeis University)
The diversitydata.org project provides metropolitan area indicators of diversity, opportunity, quality of life and health for various racial and ethnic population groups. It describes, profiles and ranks U.S. metro areas in terms of quality of life. These indicators are compiled using data from multiple data sources. They cover a range of social measures such as population, education, health, housing opportunities, economic opportunities, residential integration, neighborhood characteristics, crime and physical environment. Data can be downloaded as Excel format.
Diversitydatakids.org (Heller School for Social Policy and Management, Brandeis University)
Diversitydatakids.org is designed as an information system to monitor the state of wellbeing, diversity, opportunity and equity for U.S. children. Visitors can view and assess important differences across seven geographic levels (depending on data availability): the neighborhood, county, city, school district, metropolitan area, state and national levels. All of their data and analyses are presented by race/ethnicity, and when possible, also by socioeconomic status and immigrant status. It monitors child outcomes as well as key factors (including opportunities, conditions, and resources) that drive child outcomes.
European Data Center for Work and Welfare (EDAC) (University of Tilburg and Danish National Centre for Social Research)
The European Data Center for Work and Welfare (EDAC) is a searchable collection of annotated links to websites containing data for European research and policy-making in the areas of work and welfare. The central topics covered by the site are income and benefits, social care, and work and employment. However, the links also touch on related fields such as demographics, education, taxes, health, migration, politics and elections, and quality of life. EDAC organizes its site, via a left-hand menubar, around the categories of Comparative Data, National Data, and International Repositories. The Comparative Data category is the most detailed, with subheadings for opinion surveys, socio-economic surveys, indicators and statistics, and policies and institutions. Each survey in the Comparative Data category gets a multi-part description on the EDAC site, from survey type to participating countries to topics to availability and searchability of questionnaires and data. The National Data category, by contrast, gives only links and archive names, and the International Repositories category gives a short descriptive paragraph for each link.
European Social Survey (ESS) (European Commission and European Science Foundation)
The European Social Survey (ESS) is a new survey encompassing 19 European nations and focusing on public attitudes and values related to the ongoing change in social institutions. An initial round of data collection was completed in 2001, using a core questionnaire and two rotating modules. The microdata was released in August 2003 via the NESSTAR system at http://ess.nsd.uib.no/nesstarlight/index.jsp. Funding for a second round of data collection has also been obtained, with data release scheduled for 2005.
FlowingData (Nathan Vau)
Since mid-2007 Nathan Vau, a PhD candidate in Statistics at UCLA, has been running the FlowingData blog. His interest is in data visualization, and his blog has attracted other like-minded data enthusiasts, who interact in a fascinating conversation, with lots of thought-provoking images and animations, on how data can be presented.
One useful category of post holds up data graphics from the media for critique by the FlowingData community. Another category presents visualizations created by the blog author himself, with animations on such topics as mapping the expansion of WalMart in the United States over time, and mapping the use of the word "inauguration" on Twitter messages worldwide in the hours surrounding the events of January 20, 2009 in Washington DC. FlowingData also holds contests for its readers to contribute visualizations based on a given dataset, while a forum page adds opportunities for reader input.
Fragile Families & Child Wellbeing Study (Princeton University and Columbia University)
From the web site: "The Fragile Families and Child Wellbeing Study is following a cohort of nearly 5,000 children born in large U.S. cities between 1998 and 2000 (roughly three-quarters of whom were born to unmarried parents). We refer to unmarried parents and their children as 'fragile families' to underscore that they are families and that they are at greater risk of breaking up and living in poverty than more traditional families.
The core FF Study was designed to primarily address four questions of great interest to researchers and policy makers: (1) What are the conditions and capabilities of unmarried parents, especially fathers?; (2) What is the nature of the relationships between unmarried parents?; (3) How do children born into these families fare?; and (4) How do policies and environmental conditions affect families and children?"
Four waves of public-use data are available for download (free registration required.) The site also outlines procedures for applying for restricted ("Contract") data that includes geographic identifiers.
Gateway to Global Aging Data (University of Southern California, the Dornsife Center for Economic and Social Research (CESR), Program on Global Aging, Health & Policy)
Gateway to Global Aging Data is a platform designed for harmonizing cross-national studies of aging to Health and Retirement Study (HRS). It includes Health and Retirement Study (HRS), Mexican Health and Ageing Study (MHAS), English Longitudinal Study on Ageing (ELSA), Survey of Health, Ageing, and Retirement in Europe (SHARE), Korean Longitudinal Study on Aging (KLoSA), Japanese Study on Aging and Retirement (JSTAR), Indonesia Family Life Survey (IFLS), China Health, Aging, and Retirement Longitudinal Study (CHARLS), Irish Longitudinal Study on Ageing (TILDA), Study on Global Ageing and Adult Health (SAGE), and Longitudinal Aging Study in India (LASI). It has a digital library contains survey questions, sets of harmonized variables, and tools to search, compare, and obtain the information from various health and retirement surveys from 25 countries.
General Social Survey (GSS) (National Opinion Research Center (NORC))
"The General Social Survey (GSS) conducts basic scientific research on the structure and development of American society with a data-collection program designed to both monitor social change within the United States and to compare the United States to other nations." The GSS has been conducted regularly since 1972, and many of the core questions are unchanged to allow comparison across years. The GSS site at NORC allows users to search all GSS documents, browse GSS variables, download in SAS or SPSS, or analyze the data online in Nesstar. (The site also links to the SDA online analysis site at Berkeley.)
Generations and Gender Contextual Database (GGP) (United Nations Economic Commission for Europe)
The Generations and Gender Programme (GGP) is a research infrastructure for Social Science. It provides micro- and macro-level data at the national and regional level to improve the knowledge base for social science and policymaking in Europe and developed countries elsewhere. GPP has these key features: cross-national comparability, a longitudinal design, a large sample size, a broad age range, a combination of micro and macro data and a theory-driven and multidisciplinary questionnaire. The GGP surveys and contextual database provide a wide variety of data for policy relevant research by demographers, economists, sociologists, social policy researchers, social psychologists and epidemiologists. Researchers need to register first before they access GGP data.
German Socio-Economic Panel (GSOEP) (Deutsches Institut fuer Wirtschaftsforschung (DIW))
The GSOEP is a longitudinal study of private households across Germany, ongoing since 1984. Some of the many topics include household composition, occupational biographies, employment, earnings, health and satisfaction indicators. The GSOEP web site carries information about the study, including the questionnaires and a web interface to the data (SOEPinfo) that allows for frequency queries and item correspondence. Also available is contact information for obtaining the data directly from GSOEP. [NOTE: DISC has 1984-2002 available on CD-ROM in the library, Study #CA-511-001; more current data releases are available for purchase from GSOEP.]
Global Urban Observatory (United Nations Human Settlement Programme)
The Global Urban Observatory (GUO) program is aimed at developing and applying policy-oriented urban indicators & statistics on an international basis. UN-Habitat Urban Data includes indicators in demography, economy, education, health, streets, slum dwellers, and crime in regions, countries and cities.
Henry A. Murray Research Archive (Institute for Quantitative Social Science (IQSS), Harvard University)
The Henry A. Murray Research Archive holds many longitudinal studies, and that it includes not only quantitative data, but also qualitative materials such as case histories, open-ended interviews, and audio and video tapes.
IISS Armed Conflict Database (ACD) (The International Institute for Strategic Studies)
The International Institute for Strategic Studies was founded in the UK in 1958 with a focus on nuclear deterrence and arms control. It is known for its annual Military Balance assessment of countries' armed forces. Its Armed Conflict Database (ACD) monitors armed conflicts worldwide, focusing on political, military and humanitarian trends in current conflicts, whether they are local rebellions, long-term insurgencies, civil wars or inter-state conflicts. The statistics, data and reports in the ACD date back to 1997. Licensed users can generate reports and download data as well as browse year-by-year analyses and fact sheets.
Indiastat (UW-Madison subscription) (Datanet India)
Indiastat.com provides socio-economic statistical data primarily from Indian government sources in electronic formats (both HTML and Excel files). Data includes statistics on demographics, health, higher education, agricultural and industrial production information, economic statistics, etc. Information from specific sectors such as bank and financial institutions, companies, co-operatives, crime and law, population, foreign trade, labor and workforce, housing, media, power, and transportation are also included. Note: UW-Madison subscribes via IP-authentication. If asked for member sign-in, select "IP Login."
Institute for Research on Poverty (IRP) (University of Wisconsin-Madison, Institute for Research on Poverty)
IRP is a national, university-based center for research into the causes and consequences of poverty and social inequity in the United States. One of the site's highlights is their excellent compilation of Poverty-Related Links. Also among the many offerings is the Child Support Demonstration Evaluation (CSDE), which looks at both direct effects of the child support pass-through policy on child support paid and received, and a wide range of potential secondary effects. To access the public-use data, users must apply and be approved for access to the data. After payment of a fee, users are provided with instructions on how and where to submit SAS programs. CSDE data can only be accessed by submitting SAS computer programs; actual copies of the data files are not provided.
International Studies Compendium Project Datasets (The International Studies Association)
International Studies Compendium Project compiles a list of links to datasets for international studies. They are organized by topics, such as Interstate Conflict Data, Event Data, Interstate Rivalry, Contentious Issues, Intrastate, Civil, and Ethnic Conflict Data, Human Rights and Repression, Terrorism, Conflict Management Data, Geographic Data Contiguity and Distance, Conflict Location Data, Individual Political Leaders, Power and Capability Data, Minorities/Diversity/Heterogeneity, International Law Data, and Prominent Replication Data Sets.
IPUMS Higher Ed (The Minnesota Population Center)
IPUMS Higher Ed offers harmonized versions of the surveys incorporated into the NSF Scientists and Engineers Statistical Database (SESTAT), which is composed of three National Science Foundation surveys: the National Survey of College Graduates, the Survey of Doctorate Recipients, and the National Survey of Recent College Graduates. Its data includes education history, labor force status, employer and academic institution characteristics, income, and work activities. SESTAT data have been used previously to study a wide variety of topics, including gender differences in the labor force and the presence of immigrants in the U.S. science and engineering workforce.
Japanese Study of Aging and Retirement (JSTAR) (Research Institute of Economy, Trade and Industry (RIETI) and Hitotsubashi University and the University of Tokyo)
The Japanese Study of Aging and Retirement (JSTAR), a panel survey of elderly people aged 50 or older in Japan, was launched in 2007 and data from the first wave of the study was released in 2010. JSTAR focuses on the economic, social, and health conditions of elderly people. The study is designed to be comparable to the Health and Retirement Study (HRS) in the United States, Survey of Health, Aging and Retirement in Europe (SHARE) and English Longitudinal Study of Aging (ELSA). The questionnaire, codebook, and a report on the first results can be downloaded from the site. The site also describes conditions, confidentiality levels, and application instructions for qualified researchers and agencies to receive the data free of charge. The JSTAR datasets are available in Stata and CSV formats.
Joint Center for Poverty Research (JCPR) (Northwestern University and University of Chicago)
JCPR supports academic research that examines what it means to be poor and live in America, concentrating on the causes and consequences of poverty in America and the effectiveness of policies aimed at reducing poverty. The JCPR site includes information on publications, conferences, various resources, programs, and research events.
Journalism.org / Project for Excellence in Journalism (Project for Excellence, Pew Resource Center)
The Pew Research Center's Project for Excellence in Journalism has been studying news media, broadly defined, since 1997. Much of the research reported on the Journalism.org web site involves content analysis of press coverage, from traditional media to blogs and other social media sites. The project produces four regular indices: the News Coverage Index, the New Media Index, the Campaign Index, and the Talk Show Index. Also on the site is a comprehensive annual State of the News Media report, with some sortable online tables.
In the "Numbers" section of the site, users can view reports with customizable graphs, or download selected annual datasets from the regular indices listed above.
Kannisto-Thatcher Database on Old Age Mortality (Väinö Kannisto and A. Roger Thatcher)
The Kannisto-Thatcher database on old age mortality (K-T database) includes data on death counts and population counts classified by sex, age, year of birth, and calendar year for more than 30 countries. This database was established for estimating the death rates at the highest ages (above age 80). This database is maintained by the Max Planck Institute for Demographic Research. Researchers need to register first before they access this database.
KIDS COUNT (Annie E. Casey Foundation)
KIDS COUNT is a national and state-by-state effort to track the status of children in the U.S., by the Annie E. Casey Foundation. Its KIDS COUNT Data Center provides high-quality data and trend analysis in nation, state, territory and city levels. KIDS COUNT indicators can be viewed by location, by topic and by characteristics.
Kuwait - Central Statistical Bureau (State of Kuwait Central Statistical Bureau)
Socio-economic and agricultural statistics for the State of Kuwait are published by its Central Statistical Bureau. Statistical reports are downloadable in pdf format. Most reports are in Arabic but some are in English and Arabic.
Living Standards Measurement Study (LSMS) of the World Bank (World Bank)
Household surveys are available for numerous countries, depending on each country's policy concerning release of data for public use. Survey documentation without data may be downloaded for the following countries: Albania, Armenia, Ecuador, Ghana, Guyana, Jamaica, Kazakhstan, Malawi, Morocco, Nepal, Pakistan, Romania, Russia, Viet Nam (Vietnam). The following datasets are available for free download from the web site (as of 10/10):
- Albania 1996, 2002, 2003, 2004, 2005
- Azerbaijan 1995
- Bosnia & Herzogovina 2001, 2002, 2003, 2004
- Brazil 1996-97
- Bulgaria 1995, 1997, 2001, 2003
- China - Heibei and Liaoning 1995 & 1997
- Cote d'Ivoire 1985, 1986, 1987, 1988
- Guatemala 2000
- India - Uttar Pradesh and Bihar 1997-98
- Iraq 2006-07
- Kyrgyz Republic 1993, 1996, 1997, 1998
- Nicaragua 1993, 1998-99, 2001
- Panama Living Standards Survey, 1997, 2003, 2008
- Papua New Guinea Household Survey: 1996
- Peru Living Standards Survey (Encuesta de Hogares Sobre Medicion de Niveles de Vida): 1990, 1991 and 1994.
- Serbia 2002, 2003, 2007
- South Africa Integrated Household Survey (1993-94).
- Tajikistan, 1999, 2003, 2007, 2009
- Tanzania - Kagera, 1991-94, 2004
- Tanzania - National, 1993
- Timor Leste - 2001
- Yugoslavia - Kosovo, 2000
Longitudinal Aging Study Amsterdam (LASA) (Vrije Universiteit (Netherlands))
The Longitudinal Aging Study Amsterdam (LASA) was founded in 1991 to examine predictors and consequences of aging. LASA focuses on physical, emotional, cognitive and social functioning in late life, the connections between these aspects, and the changes that occur in the course of time. One cohort began in 1992, with a second cohort in 2002; observation cycles take place every three years. The sample comes from three regions of the Netherlands, with initial participant ages ranging from 55 to 85. The site provides an extensive bibliography, information on research themes and variables, and links to related sites. A detailed research proposal is required to gain access to LASA data; more information is available on the site.
Making Connections Survey (National Opinion Research Center, University of Chicago)
The Making Connections Survey, part of a larger initiative of the Annie E. Casey Foundation, is a set of longitudinal data collection activities conducted over a ten year period, in ten US cities at two points in time, and in seven of the ten cities at three points in time. The Making Connections Survey is designed to measure how neighborhood change affects the well-being of children and families.
The Making Connections Survey includes questions about the broad topics of neighborhood connections, neighborhood actions, services and amenities, organizations and volunteerism, family hardship, and income and assets. The survey was conducted in Denver, Des Moines, Indianapolis, San Antonio, Seattle (White Center), Hartford, Milwaukee, Oakland, Providence, and Louisville. The baseline survey was fielded in each of the ten Making Connections neighborhoods, and in each county that contained each Making Connections neighborhood. The Wave 2 survey was fielded in the neighborhoods only. Baseline data were gathered between 2002 and 2004. Wave 2 was completed between 2005 and 2007 in the same ten sites. The Wave 3 cycle, scheduled between 2008 and 2011, was conducted in seven of the ten sites.
Data access is available through NORC's data enclave, and requires a data-access agreement, non-exempt IRB approval, and a data enclave application.
Maryland State Data Center (Maryland State Data Center)
Although each state has Data Centers in collaboration with the U.S. Census Bureau, Maryland's online offerings are more extensive than most. The site focuses on census information and socioeconomic projects for the state of Maryland.
MIDMAC - Midlife Research (MacArthur Foundation)
The John D. and Catherine T. MacArthur Foundation Research Network on Successful Midlife Development (MIDMAC) was established in 1989 to explore different aspects of the middle age life-span from a variety of different perspectives, including psychology, sociology, anthropology, and health care. The site contains information about the research studies; actual data is archived with ICPSR (search on MIDMAC or MIDUS).
Million Song Dataset ()
The Million Song Dataset is a freely-available dataset for research on popular music and audio analysis, covering a million contemporary popular music tracks. The data does not include the audio itself, but rather a set of derived features and metadata including such elements as the song artist and title, the number of beats and bars, loudness measures, and pitch and timbre information. Data files are in HD5 format, a format developed by NASA to handle large, heterogeneous, hierarchical datasets. Wrappers are provided for python, MATLAB, and Java. The site also includes SecondHandSongs, a database of cover-songs, and the musiXmatch dataset of lyrics to songs found in the Million Song Dataset.
Multinational Time Use Study (MTUS) (Centre for Time Use Research, University of Oxford)
The Multinational Time Use Study (MTUS) comprises over 50 time-use studies from 19 countries, some as far back as the 1960s. The studies included in MTUS have been recoded to harmonize the variables, allowing analysis that crosses boundaries of countries and years. The MTUS archive contains files from the original surveys as well as the harmonized and aggregate files. While there is no cost to use MTUS data, registration is required for each researcher planning to use the data, and some of the data is subject to additional restrictions.
National Collaborative on Childhood Obesity Research (NCCOR) Catalog of Surveillance Systems (
National Collaborative on Childhood Obesity Research (NCCOR))
The National Collaborative on Childhood Obesity Research (NCCOR) Catalogue of Surveillance Systems has 100 publicly available datasets with information on health behaviors, outcomes, determinants, policies and environmental factors. This free online resource was created for researchers and practitioners to investigate childhood obesity in America.
National Gambling Impact Study Commission (National Gambling Impact Study Commission)
The NGISC was created by Congress in 1996 and charged "to conduct a comprehensive legal and factual study of the social and economic impacts of gambling on federal, state, local, and Native American tribal governments; and on communities and social institutions." This site contains the Commission's 1999 report and a link to the FTP site for NORC's Gambling Impact and Behavior Study dataset.
National Longitudinal Surveys (Bureau of Labor Statistics)
The National Longitudinal Surveys (NLS) are a set of surveys designed to gather information at multiple points in time on the labor market activities and other significant life events of several groups of men and women. The site includes survey overviews, documentation, and links to access data via the NLS Investigator web utility.
- National Longitudinal Survey of Youth 1997 (NLSY97)-- Survey of young men and women born in the years 1980-84; respondents were ages 12-17 when first interviewed in 1997.
- National Longitudinal Survey of Youth 1979 (NLSY79)-- Survey of men and women born in the years 1957-64; respondents were ages 14-22 when first interviewed in 1979.
- NLSY79 Children and Young Adults-- Survey of the biological children of women in the NLSY79.
- National Longitudinal Surveys of Young Women and Mature Women (NLSW)-- The Young Women's survey includes women who were ages 14-24 when first interviewed in 1968. The Mature Women's survey includes women who were ages 30-44 when first interviewed in 1967. These surveys were discontinued in 2003.
- National Longitudinal Surveys of Young Men and Older Men-- The Young Men's survey, which was discontinued in 1981, includes men who were ages 14-24 when first interviewed in 1966. The Older Men's survey, which was discontinued in 1990, includes men who were ages 45-59 when first interviewed in 1966.
National Marriage Project (University of Virginia)
The National Marriage Project, formerly located at Rutgers, is now at the University of Virginia. It is a national clearinghouse for research and information on marriage. Reports on the status of marriage are issued regularly by the project. An annual compendium of statistics and other information on marriage called The State of Our Unions is available from this web site. Periodically their researchers also conduct surveys and fieldworks with a focus on younger non-college men and women, and review and critique marriage and family-life education programs in the schools. It is a useful source to locate citations to datasets used in research on marriage and relationships.
NCCP Data Tools (National Center for Children in Poverty, Columbia University)
The NCCP Data Tools page contains five elements:
- Basic Needs Budget Calculator - calculates cost of minimum basic necessities for a given location and family composition
- Family Resource Simulator -- shows the impact of various federal and state "work support" benefits on the budgets of low- to moderate-income families.
- 50-State Demographics Wizard -- creates custom HTML tables (downloadable in CSV) of national- and state-level statistics about low-income or poor children, using Current Population Survey ASEC data
- 50-State Policy Wizard -- creates custom tables with information about state and federal policies that assist low-income families and children.
- Income Converter -- to convert values between annual income, percent of federal poverty guidelines, and percent of state median income for a given family size, state and year.
NEMDA - Key Data on Ethnic Minorities in Great Britain (National Ethnic Minority Data Archive)
Sponsored by The National Ethnic Minority Data Archive project (NEMDA), this site contains data about ethnic minorities in Great Britain, primarily from the 1991 census. There is no raw data, but the site provides summary charts in HTML and PDF reports. Note: last updated in 1999.
New Beneficiary Data System (U.S. Social Security Administration)
The Social Security Administration's NBDS contains extensive information on the changing circumstances of aged and disabled beneficiaries. Based initially on a national cross-sectional survey of new beneficiaries in 1982, the original database has been expanded with information from administrative records and a second round of interviews in 1991. Variables measured in the original New Beneficiary Survey (NBS) include demographic characteristics; employment, marital and childbearing histories; household composition; health; income and assets; program knowledge; and information about the spouses of married respondents. Public use data and documentation may be downloaded from the site.
NLS Investigator (Center for Human Resource Research (CHRR), Ohio State University)
The NLS Investigator is a web-based interface to documentation and data from all the cohorts of the National Longitudinal Study (NLS). Like its predecessor Web-Investigator, NLS Investigator allows users to search the database by variable name, question text, survey year and question number. Users can view the codebook information associated with variables, select and extract variables, and create a codebook unique to the variables chosen. Investigator provides value labels in the statistical results files. A weighting program option lets users create a custom set of survey weights, making it easier to accurately calculate summary statistics from multiple years of data. Registered users can perform variable extractions without downloading any software or full data files, and can update and save their tag sets on the server for up to 90 days. Result files can be saved to a local computer or left in a personal NLS Investigator account for up to 4 days. Note: the old Web-Investigator version will be disabled after October 29, 2010. Users who are using the new NLS Investigator for the first time will have to complete a one-time free re-registration.
OECD Factbook (Organisation for Economic Co-operation and Development (OECD))
OECD Factbook is a comprehensive and dynamic statistical publication. Close to 100 indicators cover a wide range of areas: economy, education, energy, transportation, environment, development, health, industry, information and communications, population, employment and labour, trade and investment, taxation, public expenditure and R&D. Starting in OECD Factbook 2015-2016 new indicators are included on a number of regional indicators including GDP by metropolitan area. The Factbook is available through the UW-Madison OECD iLibrary subscription.
Paises@ (Countries@) (Instituto Brasileiro de Geografia e Estatística (IBGE))
The Instituto Brasileiro de Geografia e Estatistica (Brazilian Institute of Geography and Statistics) has announced a new map-based data extractor called Paises@ ("Paises" translates to "Countries"). This Flash-based ready reference tool contains basic population, social indicator, economic, technological, land use, and United Nations Millenium Development Goals information for each of the 192 United Nations members for the latest year available.
To use the extractor, first select a language (Portuguese, English, Spanish). Then select a country from the World Map on the Paises@ main page. Basic information about the country will appear. Select a category of indicators for that country, then click the globe icon next to an individual indicator to get a sortable listing for that indicator for all countries in the database. Additional information for each country includes a political map, a short slide show featuring photographic highlights of the country, and a link to a Google satellite map.
Note: for geographically-small countries, users may have to use the magnifying glass icon (+) several times, in conjunction with the arrow keys, until the country becomes large enough to click on.
Panel Study of Income Dynamics (PSID) (National Science Foundation and National Institute on Aging and others)
The PSID is a longitudinal survey of a representative sample of US individuals and their families, ongoing since 1968. The data were collected each year through 1997, and every other year starting in 1999. Topics include income and wealth, expenses, education, and health care. A section on philanthropic giving and volunteering was added in 2001. A subsetting utility is available on the site, or data files may be downloaded in their entirety.
Population Europe Resource Finder and Archive (PERFAR) (Max Planck Society for the Advancement of Science)
The Population Europe Resource Finder and Archive (PERFAR) links policies, data, and research for users to explore population developments. It is designed to meet the information needs of researchers, policy makers, civil society organizations and journalists. It includes detailed information on policies over time, a link catalog to socio-economic and demographic data, and an online repository of research outcomes. It covers countries outside of Europe as well.
Poverty Guidelines, Research and Measurement (Assistant Secretary of Planning and Evaluation, U.S. Dept. of Health & Human Services
and Human Services))
This site includes the annual report on HHS Poverty Guidelines, as well as information contacts and references on the Poverty Guidelines, the Poverty Thresholds, and the Development and History of the U.S. Poverty Lines. The site also includes several PDF format articles on poverty measurement.
Project on Human Development in Chicago Neighborhoods (PHDCN) (Inter-university Consortium for Political and Social Research (ICPSR) and Harvard School of Public Health; John D. and Catherine T. MacArthur Foundation)
According to the web site at ICPSR, "The Project on Human Development in Chicago Neighborhoods (PHDCN) is a large-scale, interdisciplinary study of how families, schools, and neighborhoods affect child and adolescent development. It was designed to advance the understanding of the developmental pathways of both positive and negative human social behaviors." Three types of quantitative data are available as of May 2010:
- Community Surveys, measuring the structural conditions and organization of neighborhoods in Chicago in 1994-95.
- Systematic Social Observation data, collected in 1995 by videotaping sampled city blocks and coding characteristics.
- A Longitudinal Cohort Study collected three waves of data (1994-1997, 1997-1999, and 2000-2001) from a sample of children, adolescents, young adults, and their primary caregivers.
- An Infant Assessment unit of the Longitudinal Cohort Study.
Russia Longitudinal Monitoring Survey-Higher School of Economics (RLMS-HSE) (University of North Carolina at Chapel Hill)
The RMLS-HSE looks at the social and economic effects of the Russian Republic's economic reforms in the 1990s. Household data has been collected 18 times since 1992; the survey was designed to capture overall economic and health well-being. While some of the data at one time required payment, currently a partnership with the Higher School of Economics in Moscow has resulted in the data being made available free of charge. All potential users of the data must first register and then complete the requirements outlined in the Data-Use Agreement, including IRB approval and an explicit confidentiality plan.
SEDLAC - Socioeconomic Database for Latin American Countries (Center for Distributional, Labor and Social Studies (CEDLAS) of the University of La Plata, Argentina and Latin America and the Caribbean Poverty and Gender Group, World Bank)
The SEDLAC database brings together data from household surveys in Latin America and the Caribbean, focusing on poverty and other social variables. The database incorporates over 200 surveys from 25 countries, presenting annual figures at the national and subnational level in Excel spreadsheets, text briefs, and maps. Users can browse the site by thematic category, or create their own tables using pull-down menus. A "Statistics by Gender" area of the site, updated in December 2009, breaks out the data by gender and also presents gender inequality variables.
Small Area Income and Poverty Estimates (SAIPE) (U.S. Bureau of the Census)
From the SAIPE site: "The U.S. Census Bureau, with support from other Federal agencies, created the Small Area Income and Poverty Estimates (SAIPE) program to provide more current estimates of selected income and poverty statistics than those from the most recent decennial census. Estimates are created for states, counties, and school districts. The main objective of this program is to provide updated estimates of income and poverty statistics for the administration of federal programs and the allocation of federal funds to local jurisdictions. Beginning with the estimates for 2005, data from the American Community Survey are used in the estimation procedure; all prior year estimates used data from the Annual Social and Economic Supplements of the Current Population Survey."
Files are released annually. State and county files cover the years 1989, 1993, and 1995 onward. School district files cover 1995, 1997, and 1999 onward. Files are downloadable in .xls and .txt format. Users can also use drop-down menus to display data in HTML tables.
Social Indicators of Development (SID) (World Bank and CIESIN)
SID, hosted by CIESIN, contains the World Bank's most detailed data collection for assessing human welfare to provide a picture of the social effects of economic development. Data are presented for over 170 economies, omitting only those for which data are inadequate. Data are through 1993.
Social Research Update (University of Surrey, United Kingdom)
This general reference periodical for beginning social science researchers is issued quarterly by the Department of Sociology, University of Surrey, Guildford, England. Previous issues have included such topics as Ethnographic writing, Archiving qualitative research data, and Secondary analysis of qualitative data.
Social Science Japan Data Archive (SSJDA) (Center for Social Research and Data Archives, Institute of Social Science, University of Tokyo)
The Social Science Japan Data Archive (SSJDA) is affiliated with the Center for Social Research and Data Archives in the Institute of Social Science at the University of Tokyo. SSJDA collects, maintains, and provides access to social science data to researchers who are interested in Japanese quantitative data for secondary analyses. Users are required to fill out online applications and get approval before they can access datasets housed in SSJDA. Most of the datasets are in Japanese.
Sociometrics Corporation (Sociometrics Corp)
The Sociometrics Corporation is a commercial entity that provides data & documentation in six major areas: sexuality, health & adolescence; family; social research on aging; drug abuse; AIDS/STD; and disability. Studies can be searched or browsed; The datasets may be ordered for a fee, or may be available through other DISC subscriptions.
SocioSite (University of Amsterdam)
SocioSite covers a startlingly broad array of sociological resources on the Internet, and is being diligently maintained. The focus is from the Dutch viewpoint but the universe is world-wide. A detailed list of subject areas serves as a jumping-off point, including a section on social science data archives.
State Health Facts (Kaiser Family Foundation)
The State Health Facts site "is designed to provide free, up-to-date, and easy-to-use health data on all 50 states." Coverage falls into twelve subject areas:
- Demographics and the economy
- Health costs and budgets
- Health coverage and uninsured
- Health insurance and managed care
- Health Reform
- Health status
- Medicaid and CHIP
- Minority health
- Providers and service use
- Women's health
Statistical Accounts of Scotland (EDINA)
In celebration of its 5th anniversary, the EDINA National Datacentre has made available the Statistical Accounts of Scotland dataset. These data, collected by parish ministers in the 1790s and the 1830s, provide a rich record of topics such as: wealth, class and poverty; climate, agriculture, fishing and wildlife; population, schools, and the moral health of the people. A free login is required.
Survey of Income and Program Participation (SIPP) (Bureau of Labor Statistics)
SIPP is a nationwide longitudinal survey for the U.S. that collects information on topics such as poverty, income, employment, and health insurance coverage. SIPP core content covers demographic characteristics, work experience, earnings, program participation, transfer income, and asset income; additional topical modules are also included with each wave. The site contains data and documentation back to 1984.
Survey Research Data Archive-Academic Sinica (Taiwan) (Academic Sinica)
The Survey Research Data Archive stores and disseminates various surveys related to Taiwan for mainly academic purposes. Studies are included on a variety of issues relating to sociological trends in Taiwan. SRDA uses Nesstar for online data dissemination and analysis; of the SRDA datasets available via Nesstar, only one (the Taiwan Education Panel Study) is available in English. Most of the rest of the site is Chinese-only as well.
Swiss Foundation for Research in Social Sciences (FORS) (University of Lausanne, Switzerland)
The Swiss Foundation for Research in Social Sciences (FORS) is an organization of the Swiss government hosted at the University of Lausanne, providing services, conducting research, and publishing and disseminating research findings in the social sciences. Surveys that are currently under the aegis of FORS include the Swiss Household Panel, the Swiss Electoral Studies Selects and the Social Report. The former Swiss Information Service and Data Archive for the Social Sciences (SIDOS) is now under FORS management as well. FORSbase is a platform to integrated data storage for primary data and metadata with a catalog and a data repository.
The Cline Center for Democracy (The Richard G. and Carole J. Cline Center for Democracy, University of Illinois at Urbana-Champaign)
The Cline Center for Democracy has a number of public datasets produced by the Societal Infrastructures and Development (SID) Project. SID offers education attainment data starting in 1950 and covering 166 countries and rule of law data with two composite measures to gauge cross-national and inter temporal differences in law-based orders for 165 nations from 1850 to 2010. Meanwhile, the Social, Political and Economic Event Database (SPEED) extracts event data from a global archive of news reports covering the Post WWII era. SPEED offers civil unrest event data and coup d'état event data in the post-World War II era.
The International Data & Economic Analysis (IDEA) at USAID (U.S. Agency for International Development (USAID))
The International Data & Economic Analysis (IDEA) is USAID's source of economic and social data and analysis. The IDEA has over 12,000 data series from over 125 sources. The data are broken down by countries, years and the following sectors: Economy, Country Ratings and Rankings, Trade, Development Assistance, Education, Health, Population, and Natural Resources. Visitors can query the International Data & Economic Analysis (IDEA) by countries, series and periods. Results can be downloaded in csv format.
The International Database on Longevity (IDL) (Max Planck Institute for Demographic Research)
The International Database on Longevity (IDL) contains thoroughly validated information on individuals of extreme ages. Data collection was performed in such a way that age-ascertainment bias is avoided. The IDL allows for the demographic analysis of mortality at the highest ages. Originally, the data were collected on individuals who attained an age of 110 years or more known as supercentenarians. In the meantime the data collection has been extended to include younger ages for some countries. Researchers can obtain IDL data after they register online.
The Quantitative Study of Dreams (University of California, Santa Cruz)
This site provides tools, methodology, and a databank for a quantitative content analysis method of dream research. Also included are links to study reports.
TransMONEE: Database of Socio-Economic Indicators for CEE/CIS Countries (United Nations Children's Fund and Innocenti Research Center)
TransMONEE contains over 150 economic and social indicators divided into ten different topics (population, natality, child and maternal mortality, life expectancy and adult mortality, family formation, health, education, child protection, crime indicators, and economic indicators) for 27 transition countries in Central Europe and the former Soviet Union. Annual time series data is available as far back as 1989. The TransMONEE data and country profiles are both available in Excel format.
Trends in UK Statistics Since 1900 (House of Commons Library)
This PDF report, illustrated with graphs and tables, summarizes selected social and economic trends in the UK since 1900, including: population, health, education, defense, crime, transport, energy, and leisure, among others. The document is part of a research report collection presented by the UK House of Commons Library. Various reports are available for 1998-2000.
U.N. Social Indicators (United Nations Statistics Division)
Most-recently-collected year's worth of basic figures for many countries, in HTML tables, in these categories: population, child-bearing, youth and elderly populations, education, human settlements, literacy, water supply and sanitation, income and economic activity, housing, and unemployment.
U.S. Demography (CIESIN)
Included here are informative explanations of the following datasets: Public Use Microdata Samples, Current Population Survey, Economic Census Data, County Business Patterns, County City Data Book, Statistical Abstract Supplement, National Economic Social and Environmental Databank, Regional Economic Information System, Enhanced County to County Migration 1985-1990, TIGER 1992 Boundaries, and STF3A Standard Extracts.
UN Classifications Registry (United Nations Statistics Division)
The Classifications registry keeps updated information on Statistical Classifications maintained by the United Nations Statistics Division (UNSD). Downloadable classifications include International Standard Industrial Classification of All Economic Activities(ISIC), Central Product Classification (CPC), Standard International Trade Classification(SITC), Classification by Broad Economic Categories (BEC), Classification of the Functions of Government (COFOG), Classification of Individual Consumption According to Purpose (COICOP), Classification of the Purposes of Non-Profit Institutions Serving Households (COPNI), Classification of the Outlays of Producers According to Purpose (COPP) and International Classification of Activities for Time-Use Statistics (ICATUS). Rulings, corrections, interpretations and proposals for future revisions are recorded and can be viewed from the Registry entries link at this site. National Classifications section includes information on national practices in the area of classifications, covering activity and product classifications used in countries around the world.
United Nations Development Programme (UNDP) (United Nations Development Programme)
This site has information on focus areas, the UN system, news, publications, and statistics on the Human Development Report and the Development Cooperation Analysis System.
Uruguay, Banco de Datos (Universidad de la Republica)
The Data Bank of the Sociodemographic and Political Information Unit (UISDP) is a source of qualitative and quantitative information about Urugayan research in Political Science, Sociology, Social Work, Population, and International Relations areas. Sources for datasets are the faculty at the Universidad de la Republica and in Uruguayan private centers of research. The web page provides access to more than a hundred data banks at Latin American and world-wide statistical institutions. Most of the web site is in Spanish.
Virginia Emigrants to Liberia Database (Marie Tyler-McGraw and Deborah Lee)
From 1820 to 1865, the state of Virginia sent more emigrants to the newly-created independent African republic of Liberia than any other U.S. state. A 2007 book by historian Marie Tyler-McGraw, recounting Virginia's participation in the Liberian colonization movement, led to a collaboration in which two related databases were placed online under the auspices of the Virginia Center for Digital History: a database of 3,700 emigrants and one of 250 emancipators. Stories, timelines and other resources are included on the site as well.
The Emigrants Database includes variables for first and last names, gender, age at emigration, place of origin, emancipator, ship name, date of emigration, level of education, occupation, Liberian destination, and additional notes. The Emancipators Database includes first and last names, locale, occupation, year of emancipation, ship name, emancipated names, and notes. Both databases are searchable by selected fields. If all selections are set to "any," all records are included in the resulting table. There does not appear to be a download option.
Windows on Urban Poverty (Bruton Center and University of Texas at Dallas)
The Windows on Urban Poverty site takes Census data from 1970 through 2000 to map the geographic dimension of poverty in the United States. The user can select a city or metropolitan area, view the location of high-poverty census tracts, and observe the changes in high-poverty areas over time. The maps can also show race, ethnicity, population density, housing value and housing age. The user interface is a simplified GIS system; for a more complex mapping tool covering the same data, come to DISC and use the Census Neighborhood Change CD-ROM. Note: the Windows on Urban Poverty site was down 1/24/07 due to hackers.
World Database of Happiness (Ruut Veenhoven, Erasmus University, Rotterdam)
According to the site description, "the World Database of Happiness is an ongoing register of scientific research on the subjective enjoyment of life. It brings together findings that are scattered throughout many studies and provides a basis for synthetic work." The site includes: a bibliography of articles published using happiness data; an "item bank" of responses to national surveys in the past 50 years; information on distributional and correlational findings on happiness; and codebook information for two datasets: States of Nations and Trends in Nations. Contact the site author for data in SPSS format.
World Tables (World Bank)
World Tables, an older publication of the World Bank, provides a detailed collection of economic data and socioeconomic indicators. Standardized data is presented for numerous individual countries, useful for international comparisons. A Supplement containing revised data was published approximately six months after the main volume. Data are presented as annual times series covering multiple years (1972-1992, 1994, 1995), and are presented as Topical Pages or as Country Pages.
World Values Survey (World Values Study Group)
The World Values Survey (WVS) is a worldwide investigation of sociocultural and political change. It is conducted by a network of social scientists at leading universities all around the world. The survey is performed on nationally representative samples in almost 100 countries which contain almost 90 percent of the world's population, using a common questionnaire. WVS has been carried out since 1981. Data files are available for download in comma delimited, SPSS SAV, R, and STATA formats (free registration required). The site also includes an online data analysis module, allowing users to select countries and samples for browsing data. Its cartographical view provides a different view of results and allows users to get indicators useful to build rankings. Maps are displayed for indicators built for each variable.
(Back to the Crossroads Search main page)