Microsoft’s AI research team, while publishing a bucket of open-source training data on GitHub, accidentally exposed 38 terabytes of additional private data — including a disk backup of two employees’ workstations.

The backup includes secrets, private keys, passwords, and over 30,000 internal Microsoft Teams messages.

The researchers shared their files using an Azure feature called SAS tokens, which allows you to share data from Azure Storage accounts.

The access level can be limited to specific files only; however, in this case, the link was configured to share the entire storage account — including another 38TB of private files.