DeepSeek 'Bug' Misunderstood as Privacy Breach, Actually Training Data Extraction

A recent social media claim suggesting that DeepSeek's chat interface allows users to access others' historical conversations has been debunked. The issue, initially labeled as a P0-level multi-tenant isolation failure, caused widespread concern over potential privacy breaches. However, the phenomenon is actually a result of training data extraction, where the model generates content resembling real dialogue based on its training data and current system prompts, not from actual user conversations. This behavior is common among large models and not unique to DeepSeek. Studies, including one by Google DeepMind in 2023, have shown that special inputs can trigger models to produce outputs from their training data. The inclusion of today's date in the generated content is due to the system prompt, not evidence of real-time data leakage. No proof has been found to confirm any multi-tenant isolation failure or that the outputs belong to specific users.

出典: 原文を表示

免責事項: Phemexニュースで提供されるコンテンツは、あくまで情報提供を目的としたものであり、第三者の記事から取得した情報の正確性・完全性・信頼性について保証するものではありません。本コンテンツは金融または投資の助言を目的としたものではなく、投資に関する最終判断はご自身での調査と、信頼できる専門家への相談を踏まえて行ってください。

​​こちらもおすすめ​​

こちらもおすすめ