Patentable/Patents/US-12008334

US-12008334

Secure translation of sensitive content

PublishedJune 11, 2024

Assigneenot available in USPTO data we have

Inventorsnot available in USPTO data we have

Technical Abstract

Methods and systems for secure translation of sensitive content are described herein. In the method, content of a file may be segmented into a plurality of sections of text. At least one section of text includes an item of sensitive content and items of nonsensitive content. The item of sensitive content may be replaced with replacement content, which enables translation of the at least one section of text without use of the sensitive content. The plurality of sections of text may be sent to remote computing devices for translation. After translation, the translation of the at least one section of text received from the remote computing device may be modified to include the item of sensitive content instead of the replacement content. A translation of the content of the file may be generated based on translations of the plurality of sections of text received from the remote computing devices.

Patent Claims

7 claims

Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.

Claim 4

Original Legal Text

4. The method of claim 1, wherein the item of sensitive content comprises a date, a number, a price, or a name of a company.

Plain English Translation

This invention relates to systems for detecting and processing sensitive content in digital documents. The problem addressed is the need to identify and handle specific types of sensitive information, such as dates, numbers, prices, or company names, within digital content to ensure compliance with privacy regulations or security policies. The method involves analyzing digital documents to detect sensitive content, which may include dates, numerical values, monetary amounts, or company names. Once identified, the system processes this content to either redact, encrypt, or flag it for further review. The detection process uses pattern recognition, natural language processing, or machine learning techniques to accurately identify the sensitive information. The system can be configured to handle different types of sensitive data based on predefined rules or user-defined criteria. After processing, the modified document is stored or transmitted securely, ensuring that sensitive information is protected from unauthorized access. This approach helps organizations comply with data protection laws and prevent leaks of confidential information.

Claim 5

Original Legal Text

5. The method of claim 1, further comprising identifying, by the computing device, the at least one section of text including an item of sensitive content.

Plain English Translation

This invention relates to a method for processing text data to identify and handle sensitive content within a computing system. The method involves analyzing text data to detect sections containing sensitive information, such as personal, financial, or confidential details. The system uses natural language processing (NLP) techniques to scan the text and flag sections where sensitive content is present. Once identified, these sections can be redacted, encrypted, or otherwise secured to prevent unauthorized access. The method ensures compliance with privacy regulations and protects sensitive data from exposure. The system may also categorize the type of sensitive content detected, such as personally identifiable information (PII), financial records, or proprietary data, to apply appropriate security measures. The approach enhances data security by automating the detection and handling of sensitive information in digital documents, databases, or communication channels. This method is particularly useful in industries like healthcare, finance, and legal services where data privacy is critical. The system may integrate with existing security protocols to provide a comprehensive solution for managing sensitive content.

Claim 7

Original Legal Text

7. The method of claim 1, wherein the segmenting further comprises segmenting the content of the file into the plurality of sections of text based on a length of the content.

Plain English Translation

This invention relates to a method for segmenting digital content, particularly text-based files, into multiple sections based on content length. The method addresses the challenge of efficiently dividing large text files into manageable portions for processing, analysis, or storage, ensuring consistency and usability across different applications. The method involves analyzing the content of a file to determine its total length, then dividing it into multiple sections of predefined or dynamically determined lengths. This segmentation ensures that each section is of a suitable size for subsequent operations, such as indexing, searching, or machine learning tasks. The segmentation process may also account for natural breaks in the text, such as paragraphs or chapters, to maintain logical coherence within each section. Additionally, the method may include metadata generation for each section, allowing for easier retrieval and organization. The segmented sections can then be stored, transmitted, or processed independently, improving efficiency and scalability in systems handling large text datasets. This approach is particularly useful in applications like document management, natural language processing, and data archiving, where dividing content into structured, length-based segments enhances performance and usability.

Claim 8

Original Legal Text

8. The method of claim 1, wherein the order of combining the translations of the plurality of sections of text received from the remote computing devices is based on an original order of the sections of text in the file.

Plain English Translation

This invention relates to distributed text translation systems, specifically methods for combining translated sections of text from multiple remote computing devices. The problem addressed is ensuring accurate reconstruction of the original document structure after distributed translation, where different sections may be processed independently by separate devices. The method involves receiving translations of multiple text sections from remote computing devices and combining them in the same order as the original sections appeared in the file. This preserves the logical flow and coherence of the translated document. The system first identifies the original order of text sections in the input file before distribution for translation. Each remote device processes a distinct section of the text and returns the translated version. The central system then reassembles these translations by matching them to their original positions in the document. This approach ensures that translated content maintains the same sequence as the source material, which is particularly important for technical documents, legal texts, or any content where section order conveys meaning. The method may also include error handling to detect and correct mismatches between translated sections and their original positions. The invention improves upon prior systems that may have combined translations arbitrarily or required manual reordering.

Claim 12

Original Legal Text

12. The apparatus of claim 9, wherein the item of sensitive content comprises a date, a number, a price, or a name of a company.

Plain English Translation

This invention relates to systems for detecting and handling sensitive content in digital documents. The problem addressed is the need to identify and manage specific types of sensitive information, such as dates, numbers, prices, or company names, within electronic documents to ensure compliance with privacy or security regulations. The apparatus includes a processing system configured to analyze digital documents for sensitive content. The system scans the documents to detect predefined types of sensitive information, such as dates, numerical values, prices, or company names. Once identified, the system can flag, redact, or otherwise process the sensitive content to prevent unauthorized access or disclosure. The apparatus may also include a user interface for configuring detection rules or reviewing flagged content. The detection process involves pattern recognition, contextual analysis, or machine learning techniques to accurately identify sensitive data within text, tables, or other document structures. The system may also integrate with external databases or knowledge bases to verify the sensitivity of detected information. For example, it can cross-reference detected company names with a list of regulated entities or validate numerical values against known price ranges. The apparatus ensures that sensitive information is handled according to predefined policies, such as encryption, access restrictions, or automated redaction, reducing the risk of data breaches or compliance violations. The system is designed to operate across various document formats, including PDFs, spreadsheets, and text files, making it adaptable to different enterprise environments.

Claim 18

Original Legal Text

18. The one or more non-transitory computer readable media of claim 15, wherein the item of sensitive content comprises a date, a number, a price, or a name of a company.

Plain English Translation

This invention relates to systems for detecting and handling sensitive content in digital documents. The problem addressed is the need to identify and manage sensitive information such as dates, numbers, prices, or company names within electronic documents to prevent unauthorized disclosure or misuse. The solution involves a computer-implemented method that processes digital documents to detect such sensitive content and applies predefined rules or policies to determine appropriate actions, such as redaction, encryption, or access restrictions. The system includes a content analysis module that scans documents for specific patterns or keywords corresponding to sensitive data types. For example, it may identify numerical values within a certain range as prices or detect alphanumeric strings matching known company names. Once identified, the sensitive content is flagged for further processing. The system then applies predefined rules to determine the appropriate handling action based on the context and type of sensitive information detected. These actions may include automatically redacting the content, encrypting it, or restricting access to authorized users only. The invention also includes a user interface for configuring the detection rules and handling policies, allowing administrators to customize the system for different types of documents and compliance requirements. The system may operate in real-time or batch processing modes, depending on the deployment scenario. By automating the detection and handling of sensitive content, the invention reduces the risk of data breaches and ensures compliance with regulatory standards.

Claim 20

Original Legal Text

20. The one or more non-transitory computer readable media of claim 15, wherein the order of combining the translations of the plurality of sections of text received from the remote computing devices is based on an original order of the sections of text in the file.

Plain English Translation

This invention relates to distributed text processing systems, specifically methods for combining translated text sections from multiple remote computing devices. The problem addressed is ensuring accurate reconstruction of translated documents when text is divided into sections and processed in parallel across different devices. The solution involves a system that receives translations of text sections from remote devices and combines them in the original order of the sections within the source file. This maintains the logical structure and coherence of the translated document. The system may also include a central server that coordinates the distribution of text sections to remote devices, receives the translated sections, and reassembles them in the correct sequence. The method ensures that translations are combined in the same order as the original text sections, preserving context and readability. This approach is particularly useful in large-scale translation tasks where processing is distributed across multiple devices to improve efficiency. The invention may also include error handling mechanisms to address missing or out-of-order translations. The system can be applied to various text processing applications, including document translation, data analysis, and content management.

Classification Codes (CPC)

Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.

G06F

Patent Metadata

Filing Date

July 16, 2020

Publication Date

June 11, 2024

Want to explore more patents?

Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.

Browse All Patents Try Prior Art Search