Global Synthetic Dataset
- URL
- https://www.ctdatacollaborative.org/page/global-synthetic-dataset
- Description
-
Largest publicly available individual-level data on human trafficking. The dataset is made possible by innovative technology to protect the safety and privacy of victims and survivors. Provides critical information on the socio-demographic profile of victims, types of exploitation, and the trafficking process, including means of control used on victims. This data, updated in 2024, represents 20 years of assistance and hotline data – with substantial contributions from IOM and Polaris, as well as contributions from A21, RecollectiV, and the Portuguese Observatory on Trafficking in Human Beings (OTSH).
This dataset represents over 206,000 victims and survivors of trafficking identified across 190 countries and territories from 2002 to 2022. Note that some attributes (e.g., countries) are suppressed as they are highly sensitive and cannot be protected, although it does not mean that no trafficking cases were recorded.
This is the third synthetic dataset derived from victim of trafficking case records. It accurately preserves the statistical properties of the original victim case records while providing the guarantee of differential privacy. Differential privacy was first developed at Microsoft Research in 2006, and today represents the gold standard in privacy protection. The differential privacy approach to synthetic data generation provides quantifiable privacy guarantees against any privacy attacks, even across multiple data releases. The technology has enabled CTDC to share more data and conduct more robust research while protecting privacy and civil liberties.
- Sample
- Format
- Data archive or collection
- Country
- Afghanistan, Albania, Algeria, Andorra, Angola, Antigua and Barbuda, Argentina, Armenia, Australia, Austria, Azerbaijan, Bahamas, Bahrain, Bangladesh, Barbados, Belarus, Belgium, Belize, Benin, Bhutan, Bolivia, Bosnia & Herzegovina, Botswana, Brazil, Brunei, Bulgaria, Burkina Faso, Burundi, Cambodia, Cameroon, Canada, Cape Verde, Central African Republic, Chad, Chile, China, Colombia, Comoros, Costa Rica, Cote d'Ivorie, Croatia, Cuba, Cyprus, Czech Republic, Democratic Republic of the Congo (formerly Zaire), Denmark, Djibouti, Dominica, Dominican Republic, Ecuador, Egypt, El Salvador, Equatorial Guinea, Eritrea, Estonia, Eswatini (Swaziland), Ethiopia, Fiji, Finland, France, Gabon, Gambia, Georgia, Germany, Ghana, Greece, Grenada, Guatemala, Guinea, Guinea Bissau, Guyana, Haiti, Honduras, Hong Kong, Hungary, Iceland, India, Indonesia, Iran, Iraq, Ireland, Israel, Italy, Jamaica, Japan, Jordan, Kazakhstan, Kenya, Kiribati, Kosovo, Kuwait, Kyrgz Republic (Kyrgyzstan), Laos, Latvia, Lebanon, Lesotho, Liberia, Libya, Liechtenstein, Lithuania, Luxembourg, Madagascar, Malawi, Malaysia, Maldives, Mali, Malta, Marshall Islands, Mauritania, Mauritius, Mexico, Micronesia, Federated States, Moldova, Monaco, Mongolia, Montenegro, Morocco, Mozambique, Multinational/Crossnational, Myanmar (Burma), Namibia, Nauru, Nepal, Netherlands, New Zealand, Nicaragua, Niger, Nigeria, North Korea, North Macedonia, Norway, Oman, Pakistan, Palau, Palestine, Panama, Papua New Guinea, Paraguay, Peru, Philippines, Poland, Portugal, Qatar, Republic of Congo, Romania, Russia, Rwanda, Saint Kitts and Nevis, Saint Lucia, Saint Vincent and the Grenadines, Samoa, San Marino, Sao Tome and Principe, Saudi Arabia, Senegal, Serbia, Seychelles, Sierra Leone, Singapore, Slovakia, Slovenia, Solomon Islands, Somalia, South Africa, South Korea, South Sudan, Spain, Sri Lanka, Sudan, Suriname, Sweden, Switzerland, Syria, Taiwan, Tajikstan, Tanzania, Thailand, Timor Leste, Togo, Tonga, Trinidad and Tobago, Tunisia, Turkey, Turkmenistan, Tuvalu, Uganda, Ukraine, United Arab Emirates, United Kingdom, United States, Uruguay, Uzbekistan, Vanuatu, Venezuela, Vietnam, Yemen, Zambia, and Zimbabwe
- Title
- Global Synthetic Dataset
- Format
- Data archive or collection