Lost in Translation: Improving Decoy Documents via Automated Translation Voris Jonathan A. author Columbia University. Computer Science Boggs Nathaniel Gordon author Columbia University. Computer Science Stolfo Salvatore author Columbia University. Computer Science Columbia University. Computer Science originator text Articles 2012 manuscript version English Detecting insider attacks continues to prove to be one of the most difficult challenges in securing sensitive data. Decoy information and documents represent a promising approach to detecting malicious masqueraders, however, false positives can interfere with legitimate work and take up user time. We propose generating foreign language decoy documents that are sprinkled with untranslatable enticing proper nouns such as company names, hot topics, or apparent login information. Our goal is for this type of decoy to serve three main purposes. First, using a language that is not used in normal business practice gives real users a clear signal that the document is fake, so they waste less time examining it. Second, an attacker, if enticed, will need to exfiltrate the document's contents in order to translate it, providing a cleaner signal of malicious activity. Third, we consume significant adversarial resources as they must still read the document and decide if it contains valuable information, which is made more difficult as it will be somewhat scrambled through translation. In this paper, we expand upon the rationale behind using foreign language decoys. We present a preliminary evaluation which shows how they significantly increase the cost to attackers in terms of the amount of time that it takes to determine if a document is real and potentially contains valuable information or is entirely bogus, confounding their goal of exfiltrating important sensitive information. Computer science IEEE CS Security and Privacy Workshops: SPW 2012: Proceedings: 24-25 May 2012, San Francisco, California, USA Los Alamitos, Calif. IEEE 2012 129 133 http://dx.doi.org/10.1109/SPW.2012.20 </titleInfo> </relatedItem> </relatedItem> <identifier type="hdl">http://hdl.handle.net/10022/AC:P:14892</identifier> <location> <physicalLocation authority="marcorg">NNC</physicalLocation> </location> <recordInfo> <recordContentSource authority="marcorg">NNC</recordContentSource> <recordCreationDate encoding="w3cdtf">2012-10-11 14:46:17 -0400</recordCreationDate> <recordChangeDate encoding="w3cdtf">2012-10-11 14:53:10 -0400</recordChangeDate> <recordIdentifier>8896</recordIdentifier> <languageOfCataloging> <languageTerm authority="iso639-2b">eng</languageTerm> </languageOfCataloging> </recordInfo> </mods>