Princeton University Library Data and Statistical 

Search DSS

Finding Data Analyzing Data Citing data

About Us

DSS lab consultation schedule
Sep 1-Nov 3By appt. here
Nov 6-Dec 15Walk-in, 2-5 pm*
Dec 18-Feb 2By appt. here
Feb 5-May 4Walk-in, 1-5 pm*
May 7-May 15Walk-in, 2-5 pm*
May 16-Aug 31By appt. here
For quick questions email
*No appts. necessary during walk-in hrs.
Note: the DSS lab is open as long as Firestone is open, no appointments necessary to use the lab computers for your own analysis.

Follow DssData on Twitter
See DSS on Facebook

Finding Data: Data on Housing


  • Selected Resources for:

    Housing - Non USA ::

  • airbnb data New York City Metro and San Francisco Metro (Airdna) (October 2014-October 2017) / Havana Cuba Report 2015
    Airbnb data for New York City and San Francisco metropolitan areas and a report for Airbnb on Cuba.

  • American Community Survey (ACS) (1996+)
    Nationwide survey designed to provide communities with a fresh look at how they are changing. 2005 is the first full year. Prior years were small test runs. Replaces the decennial long form. Tells us what the population looks like and how it lives. Includes occupancy status, homeownership data, and housing cost data. IPUMS also contains the data in an easier to use format. More recent data may be found directly on the Census website. For summary statistics, use Social Explorer.

  • American Housing Survey (1973+)
    Comprises 2 types of data collections: a national survey of housing units, and surveys of housing units in selected metropolitan areas. The interviews cover core questions that are repeated each year, and an additional set of questions on recurring or one-time supplemental topics. The national data were collected annually through 1981 and have been collected every 2 years since that time. The metropolitan-area data are collected on a continuous basis and are reported annually. For the latest see the Census Bureau site.

  • Assets and Opportunity Scorecard
    Source for data on household financial security and policy solutions. Looks at financial assets and income; businesses and jobs; housing and homeownership; healthcare; and education.

  • Building Permits ASCII Files. Place Level Annual 1980-2010 and County Annual: 1990-2011.
    Building permit statistics on new privately-owned residential construction.

  • Building Permits Survey
    Provides national, state, and local statistics on new privately-owned residential construction.

  • City of Milwaukee Realtor Violations (2005-2016)
    The violations file contains all city code violations written from 2005-2015 in Milwaukee, Wisconsin. The permits file is a listing of all permits pulled by property owners and the ownership file denotes the name and address of the person/entity who is on file as the owner of every parcel in the city.

  • Data-Planet Statistical Datasets
    Provides easy access to a wide variety of economic, social, political, and marketing indicators.

  • Fair Market Rents (1983+)
    Primarily used to determine payment standard amounts for the Housing Choice Voucher program, to determine initial renewal rents for some expiring project-based Section 8 contracts, to determine initial rents for housing assistance payment (HAP) contracts in the Moderate Rehabilitation Single Room Occupancy program (Mod Rehab), and to serve as a rent ceiling in the HOME rental assistance program. The U.S. Department of Housing and Urban Development (HUD) annually estimates FMRs for 530 metropolitan areas and 2,045 nonmetropolitan county FMR areas.

  • Fannie Mae National Housing Survey Monthly Key Indicators (a subset of the complete monthly Fannie Mae National Housing Survey)
    Track public attitudes toward owning and renting a home, purchase and rental prices, home ownership distress, household finances and overall confidence in the economy. Covers January 2010+. Includes a cumulative file.

  • Government Sponsored Enterprise Data. MSA, National, State (1993-2006)
    The Department of Housing and Urban Development publishes information on the mortgage purchases of Fannie Mae and Freddie Mac, two Government Sponsored Enterprises (GSEs) that HUD oversees. The GSEs are secondary-market institutions that purchase single-family conventional loans originated in the United States. Fannie Mae and Freddie Mac are required to meet specified goals for purchases of mortgages that finance housing for very-low-, low- and moderate-income families and families living in areas traditionally underserved by the mortgage market. The information is intended to aid mortgage lenders, planners, researchers, and housing advocates in studying the flow of mortgage credit and capital in America's communities. Data for multiple family homes can be found on the HUD website.

  • Home mortgage disclosure - HMDA/aggregation master data (1981-1989)

  • Home Mortgage Disclosure Act (HMDA): Loan Application Register (LAR) and Transmittal Sheet (TS) Raw Data (2007-2008, 2011-2013)

  • Home Mortgage Disclosure Act Raw Data (1997-2001, 2003-2005)
    HMDA Raw data for 1990-1996, 2002, and 2007-2013 are stored on the DSS webpage as is the Panel Public Data for 2000-2002. For data after 2006, also see the HMDA page. Also see Home Mortgage Disclosure Data Files, compiled 1981 - 2009.

  • Housing Affordability Data System (HADS) (1985+)
    Set of files derived from the 1985 and later national American Housing Survey (AHS) and the 2002 and later Metro AHS. Categorizes housing units by affordability and households by income, with respect to the Adjusted Median Income, Fair Market Rent (FMR), and poverty income. It also includes housing cost burden for owner and renter households. These files have been the basis for the worst case needs tables since 2001. ICPSR has 2002 and 2004 only. The site also has a quick lookup feature.

  • Housing Vacancy and Ownership Survey
    Official source of housing vacancy and home ownership data. Current information on the rental and homeowner vacancy rates, and characteristics of units available for occupancy. Rental and homeowner vacancy rates and homeownership rates are available for the U.S., regions, states, and for the 75 largest Metropolitan Areas. State and Metropolitan Area data are available annually, while national and regional data are available quarterly. Homeownership rates are also tabulated by age of householder and by family status for the U.S. and regions. In addition, estimates of the total housing inventory and percent distributions of vacant for-rent and for-sale-only units are available for the U.S. and regions. Does not include housing costs.

  • Low-Income Housing Tax Credit Database (LIHTC)
    The Low-Income Housing Tax Credit (LIHTC) is the most important resource for creating affordable housing in the United States today. The LIHTC database, created by HUD contains information on projects between 1987 and 2013. National source of information on the size, unit mix, and location of individual projects. Includes project address, number of units and low-income units, number of bedrooms, year the credit was allocated, year the project was placed in service, whether the project was new construction or rehab, type of credit provided, and other sources of project financing. Geocoded.

  • Monthly Interest Rate Survey (MSA Level - 1973-2011)
    Data from the Federal Housing Finance Agency at the metropolitan statistical area.

  • Mortgage & Asset Securities Issuance (1990-March 2011)

  • Moving to Opportunity (MTO) for Fair Housing Demonstration: Interim Impacts Evaluation, Tier 1 Restricted Access Data, 1994-2001 [United States]
    Designed to answer questions about what happens when very poor families have the chance to move out of subsidized housing in the poorest neighborhoods of 5 very large American cities. MTO was a demonstration program: its approach combined tenant-based housing vouchers with location restrictions and housing counseling. MTO was also a randomized social experiment, carefully designed and rigorously implemented to test the effects of this approach on participating families. The interim evaluation included the collection of data on a wide range of outcomes that could potentially be affected by the MTO intervention. These outcomes fit into 6 study domains: (1) mobility, housing, and neighborhood, (2) adult and child physical and mental health, (3) child educational achievement, (4) youth delinquency and risky behavior, (5) adult and youth employment and earnings, and (6) household income and public assistance receipt. The restricted access data being made available through ICPSR includes many such analytic variables constructed from surveys and administrative data. The Tier 1 data also includes a census tract ID that allows researchers to link other neighborhood-level data.

  • Neighborhood Effects on the Long-Term Well-Being of Low-Income Adults From All Five Sites of the Moving to Opportunity Experiment, 2008-2010
    Nearly 9 million Americans live in extreme-poverty neighborhoods, places that also tend to be racially segregated and dangerous. Yet, the effects on the well-being of residents of moving out of such communities into less distressed areas remain uncertain. Moving to Opportunity (MTO) is a randomized housing experiment administered by the United States Department of Housing and Urban Development that gave low-income families living in high-poverty areas in 5 cities the chance to move to lower-poverty areas. Families were randomly assigned to one of three groups: 1. The low-poverty voucher (LPV) group (also called the experimental group) received Section 8 rental assistance certificates or vouchers that they could use only in census tracts with 1990 poverty rates below 10 percent. The families received mobility counseling and help in leasing a new unit. One year after relocating, families could use their voucher to move again if they wished, without any special constraints on location. 2. The traditional voucher (TRV) group (also called the Section 8 group) received regular Section 8 certificates or vouchers that they could use anywhere; these families received no special mobility counseling. 3. The control group received no certificates or vouchers through MTO, but continued to be eligible for project-based housing assistance and whatever other social programs and services to which they would otherwise be entitled. Families were tracked from baseline (1994-1998) through the long-term evaluation survey fielding period (2008-2010) with the purpose of determining the effects of "neighborhood" on participating families.

  • New York City Housing and Vacancy Survey, 1991+
    Conducted approximately every 3 years to comply with New York state and New York City's rent regulation laws. The Census Bureau has conducted the survey for the city since 1965. The rental vacancy rate is the primary focus of the survey, because that value is crucial to the current rent control and rent stabilization laws. Other important survey data include rent regulation status, number of stories, number of units in building, number of rooms in unit, type of heating fuel, monthly rent, estimated value and building condition. Also includes information concerning housing and neighborhood quality. Although the main purpose of the survey is to collect housing data, information on the demographic status of the population and households of the city is also collected. Information collected includes age, sex, race, ethnicity, household composition, labor force status, income, employment, and education level. 1991, 1993, and 1996 are available at the DSS site. For 1999, 2002, 2005, 2008, 2011, and 2014 see the Census Bureau.

    Sample Size: Approximately 18,000 housing units representing the 5 boroughs of the city.

  • New York City. Department of Homeless Services. Statistics and Reports.

  • Panel Study of Income Dynamics (PSID) Series. 1968+
    Designed to fill the need for a better understanding of the determinants of family income and its changes. Longitudinal survey of US individuals and the families in which they reside. Can be used for cross-sectional, longitudinal, and intergenerational analysis and for studying both individuals and families. For comparisons of Current Population Survey, Survey of Income and Program Participation, and PSID, see the comparison chart. For comparisons of National Longitudinal Surveys, Survey of Income and Program Participation, and PSID, see the comparison chart. Includes section on education, food, health, housing, psychological state, and religion. Geography is down to the state level. Also see Poverty Threshold data, 1968-2005.
    Includes various supplemental surveys including:
    • Disability and Use of Time (DUST): collected information from older adults in PSID families about disability, time use, and well-being during telephone interviews conducted shortly after the 2009 and 2013 Main Interviews.
    • Child Development Supplement (CDS): In 1997 information on PSID children ages 0-12 was obtained through extensive interviews with the child, their primary caregiver, secondary caregiver, absent parent, teacher, and school administrator. Information was collected again in 2002/2003 and 2007/2008 for children in this cohort who remained under 18. Information includes a broad array of developmental outcomes including physical health, emotional well-being, intellectual and academic achievement, cognitive ability, social relationships with family and peers, time diaries, and much more. The CDS 2014 covers all sample children in PSID households born or adopted into the PSID since 1997. The study design and questionnaire content are consistent with earlier waves of CDS to permit cross - cohort analysis of children's development.
    • Housing, Mortgage Stress, and Wealth Data (2009, 2011, 2013, 2015) as well as Wealth files for 1984, 1989, 1994, 1999, 2001, 2003, 2005 and 2007.
    • Transition into Adulthood: When children in the CDS cohort are older than 18 and have left high school, information is obtained about their circumstances. This study has been implemented in 2005, and biennially thereafter. Information includes measures of time use, psychological functioning, marriage, family, responsibilities, employment and income, education and career goals, health, social environment, religiosity, and outlook on life.
    • Childhood Retrospective Circumstances Study (PSID - CRCS). First study conducted by the PSID using the internet as the primary mode of data collection. Goal was to design and collect a mixed mode (web or paper) module from household heads and, if married/cohabitating, spouses/partners, about their childhood experiences. The data may be used to study early life influences on adult health and economic outcomes. Has questions on parental relationships, health, socioeconomic status, neighborhood quality, friendships, school experiences, exposure to the criminal justice system, parent/guardian mental health, and young adut mentoring. The initial PSID-CRCS sample consisted of 13,117 individuals aged 19 and older (aged 19 by January 1, 2013) who were household heads and spouses/partners in PSID families that participated in the 2013 wave of PSID. Individuals for which other family unit members or proxies served as respondents in the 2013 core PSID interview and those who completed their core interview in Spanish were not eligible (N=593). During editing eligibility status was reviewed and confirmed for 12,985 cases. Many portions of CRCS are restricted.

  • Property Owners & Managers Survey (POMS)
    Was designed to learn more about rental housing and the providers of rental housing. The purpose was to gain a better understanding of the property owners and managers on whom the nation depends to provide affordable rental housing and what motivates their rental and maintenance policies. Interviewing for the survey was done between November 1995 and June 1996. A nationwide sample of approximately 16,300 housing units which were rented or vacant-for-rent in the 1993 American Housing Survey National Sample (AHS-N) was selected, and a questionnaire was mailed to the property owner, manager, or other agent of the owner of each property containing a selected unit. Detailed information was collected on maintenance, management practices, tenant policy, financial aspects of rental property ownership, owner characteristics, and related topics.

  • RealtyTrac Activity Reports and Inventory Reports (2005-March 2017)
    The Foreclosure Activity Report provides tracking of past and current foreclosure market trends, as well as forecasting future trends. The data provided in the report includes the total number of Notices of Default (NOD), Lis Pendens (LIS), Notices of Trustees' Sale (NTS), Notices of Foreclosure Sale (NFS) and Real Estate Owned (REO) for the entire period requested on either a monthly, quarterly or annual basis. The Inventory Report provides a listing of every property in RealtyTrac's foreclosure database at a selected moment in time. It provides investors, lenders and loan servicers with total counts of all properties RealtyTrac shows as currently in some stage of the foreclosure process as of the time the report is requested and processed. Similar to the Activity Report, the information provided in the Inventory Report is broken down by NOD, LIS, NTS, NFS, and REO. Unlike the Activity Report, however, the Inventory Report shows all inventories as of the instant the report is made, as opposed to taking a historic look back at previous activity. Princeton holdings for activity reports at the MSA level are for 2005-2009 and at the zip code level for 2007-March 2017. Princeton holdings for inventory reports are at the MSA level for 2007-March 2017.

  • Rent Stabilization Ordinance (RSO) Study Data
    Survey data from property owners and managers as well as renters in Los Angeles.

  • Residential Finance Survey (RFS) (1991)
    Provides data on the characteristics of the financing of homeowner and rental properties, including characteristics of the mortgages, properties, and property owners. There are 2 files: a homeowner property file and a rental and vacant property file. There are 2 record types on each file: a property record (information provided by the property owner), and a mortgage record (information provided by the mortgage lender). The potential universe is about 66,000,000 properties securing about 38,000,000 mortgages, plus all mortgage lenders in the United States. About 70,000 properties were in this sample. Geography: United States, regions, inside/outside metropolitan areas, central cities, not in central cities; and the states of California, Florida, New York and Texas.

  • Residential Finance Survey (RFS) (2001)
    Only survey designed to collect and produce data about the financing of nonfarm, privately-owned residential properties. Collects, tabulates, and presents data for properties, the standard unit of reference for financial transactions related to housing. In most other demographic surveys, the unit of reference is the person, household, or housing unit. Only source of information on property, mortgage, and financial characteristics for multi-unit rental properties. Information on multi-family loans and properties is particularly difficult to obtain, but is important to understand if progress is to be made in the development of standards for underwriting multi-family mortgages. Conducts interviews of property owners and mortgage lenders, resulting in more accurate information on property and mortgage characteristics. Only survey which is able to provide a comprehensive view of mortgage finance in the USA, by providing information not only about the loan itself from the lender, but also information about the property owner's demographic characteristics. As part of the decennial census, it is mandatory. This is important in collecting information from mortgage lenders. Exempt from statutes prohibiting release of financial records by financial institutions. Able to subdivide the industry into relevant components. Different parts of the industry have excellent information on their own loans and clients, but not that of the industry as a whole. Information on lending by individual investors or small groups of investors such as pension funds is collected only by the RFS.

  • Survey of Construction
    Provides current national and regional statistics on starts, completions, and characteristics of new, privately-owned single-family and multifamily housing units and on sales of new single-family houses.

  • Survey of Income and Program Participation. (SIPP) 1984+
    Longitudinal U. S. government survey of the financial status of American households conducted since 1983 (data starts with 1984). Covers government transfer and service programs, pension coverage, housing affordability, home ownership data, housing cost data (primarily mortgages), financial assistance for education, among other topics. Data may also be accessed via ICPSR. The National Bureau of Economic Research (NBER) makes the SIPP data and documentation available along with Stata, SPSS, and SAS programs for reading the data. Users may find the NBER site the most convenient source, but the most recent data may not be available there. A new sample (panel) is introduced at the beginning of each calendar year, and the duration of each panel ranges from 2 1/2 years to 4 years. Considered the most comprehensive survey for measuring household wealth by race and ethnicity. Panels also contain topical modules. Data is at the state level. For comparisons of the Current Population Survey, SIPP, and the Panel Study of Income Dynamics, see the comparison chart. See the Household Economic Studies (P70) series for summary reports. See the Census Bureau for an overview.

    Sample Size: Ranges from approx. 14,000 to 37,000 households. Interviews are conducted with those 15 and over.

  • Survey of Market Absorption of Apartments (SOMA)
    Uses the Census Bureau's Survey of Construction (SOC) as its sampling base. Each month, a sample of residential buildings containing 5 or more units is selected. The initial 3-month interview collects information on amenities, rent or sales price levels, number of units, type of building, and the number of units taken off the market (absorbed). Field representatives conduct subsequent interviews, if necessary, at 6, 9, and 12 months after completion. Beginning in 2002, the survey started collecting information on "Senior Housing". Quarterly reports, including basic tabulations, are published and released on the Internet. These quarterly reports are generally available the first week of March, June, September, and December. Additionally, data on absorption rates for selected metropolitan areas are also released on the Internet at the same time. Pre-publication data, detailing information for regular rental units and condominiums is published approximately one month earlier. There are also 2 annual publications released on the internet, the H-130 Annual which provides 12-month absorption data for the previous year and is released the first week April and the H-131 Characteristics of Apartments Report which provides annual 3-month absorption rates - released the first week of July.

  • U. S. Census - IPUMS. 1850+
    Public use samples of individual-level data from the U. S. census.

  • Weekly Applications Survey
    15 indices covering application activity for fixed rate, adjustable rate, conventional and government loans for home purchases and refinances

This page last updated: October 21, 2009