The tool to provide anonymous data in response to a request for information is configured to receiving a request for information relating to a population of interest from a user; determining criteria associated with the population of interest, wherein the criteria define an individual representative of the population of interest; determining a characteristic of the individual within one or more categories; classifying the characteristic of the individual into a cluster, the cluster comprising a subset of all possible outcomes for the category; classifying a plurality of individuals from a database into clusters based on the one or more categories; determining a subset of the plurality of individuals that are similar to the individual representative of the population of interest based on the clusters; transforming the characteristics of the individuals in the subset to generate anonymous data responsive to the request for information; and providing the anonymous data to the user.
Legal claims defining the scope of protection. Each claim is shown in both the original legal language and a plain English translation.
1. A system for providing anonymous data in response to a request for information, the system comprising: a memory device having computer readable program code store thereon; and a processing device operatively coupled to the memory device, wherein the processing device is configured to execute the computer readable program code to: receive a request for information relating to a population of interest from a user; determine criteria associated with the population of interest, wherein the criteria defines a first individual representative of the population of interest; determine a characteristic of the first individual within one or more categories, wherein the characteristic of the first individual is based on data associated with the first individual; determine multiple clusters, wherein each cluster comprises a subset of all possible outcomes for one of the categories, wherein the cluster comprises a group of values for an age and location of the population of interest; classify the first individual into one of the clusters, wherein classifying the first individual is based on the characteristic of the first individual within the one or more categories; classify other individuals from a database into one of the clusters, wherein classifying each of the other individuals is based on a characteristic of the other individual within the one or more categories; order the multiple clusters into magnitudes from smallest to largest magnitude to generate a harp-shaped distribution presentation; determine a subset of the other individuals that are similar to the first individual, wherein the determination is based on the distribution of other individuals among the clusters; transform the characteristics of the subset of the other individuals to generate anonymous data responsive to the request for information, wherein the anonymous data is based on the distribution of the subset of the other the individuals among the clusters; and provide the anonymous data to the user.
A system provides anonymous data in response to a request for information. The system receives a request for information about a population of interest. It determines criteria defining a representative individual. It finds characteristics of that individual like age and location. It classifies the individual into a cluster with similar values, and classifies other individuals from a database into clusters based on the same characteristics. It orders the clusters by magnitude to create a harp-shaped distribution. It identifies a subset of other individuals similar to the representative individual based on the cluster distribution, transforms their characteristics to generate anonymous data based on their distribution within the clusters, and provides the anonymous data.
2. The system of claim 1 , wherein the processing device is further configured to execute the computer readable program code to: order the clusters of individuals based on a magnitude of each cluster and a distribution selected from the group consisting of a normal distribution, an exponential distribution, and a Poisson distribution; and determine a function that represents the individuals in the subset using a zeta function.
Building upon the system for providing anonymous data, the clusters of individuals are ordered based on their magnitude and a statistical distribution such as normal, exponential, or Poisson. The system then determines a function, specifically a zeta function, that represents the individuals within the subset used to generate the anonymous data. This function statistically models the characteristics of the subset.
3. The system of claim 1 , wherein the processing device is further configured to execute the computer readable program code to: determine the characteristic of the first individual within one or more categories by: identifying categories associated with the request for information; accessing a database comprising information associated with the population of interest; and determining the characteristic by comparing the categories associated with the request for information with the information stored in the database.
In the system for providing anonymous data, the determination of an individual's characteristics involves identifying relevant categories associated with the information request. The system accesses a database containing information about the population of interest and determines the individual's characteristics by comparing the request categories with the information stored in the database.
4. The system of claim 1 , wherein the processing device is further configured to execute the computer readable program code to: determine a magnitude of a cluster based on at least one of a number of individuals in the cluster and a transformation based on the number of individuals in the cluster and a value of the characteristic for each individual.
Within the system for generating anonymous data, the magnitude of a cluster is determined by at least one of: the number of individuals in the cluster, or a transformation based on both the number of individuals and the characteristic value for each individual in that cluster.
5. The system of claim 4 , wherein the transformation is a product of the number of individuals in the cluster and the value of the characteristic for each individual.
In the system for generating anonymous data, the transformation for determining cluster magnitude (based on both the number of individuals and characteristic value) is specifically calculated as the product of the number of individuals in the cluster and the value of the characteristic for each of those individuals.
6. The system of claim 1 , wherein the processing device is further configured to execute the computer readable program code to transform the characteristics of the individuals in the subset of the other individuals to generate anonymous data responsive to the request for information by determining an average for the subset of the other individuals.
In the system for generating anonymous data, the transformation of characteristics of the subset of individuals, used to generate the anonymous data, is achieved by calculating an average value for the characteristics of the individuals within that subset.
7. The system of claim 1 , wherein the processing device is further configured to execute the computer readable program code to: identify a plurality of characteristics of the first individual; classify a subset of the other individuals into clusters based on the plurality of characteristics; and determine a maximum number of characteristics, wherein the maximum number of characteristics is determined based on a predetermined minimum permissible number of individuals in the subset.
Enhancing the system for anonymous data, the system identifies multiple characteristics of the representative individual. Other individuals are classified into clusters based on these multiple characteristics. The system determines a maximum permissible number of characteristics to consider, based on a pre-defined minimum number of individuals that must be present in the resulting subset. This limits the dimensionality of the characteristics considered.
8. A computer program product for providing anonymous data in response to a request for information, the computer program product comprising at least one non-transitory computer-readable medium having computer-readable program code portions embodied therein, the computer-readable program code portions comprising: an executable portion configured to receive a request for information relating to a population of interest from a user; an executable portion configured to determine criteria associated with the population of interest, wherein the criteria define a first individual representative of the population of interest; an executable portion configured to determine a characteristic of the first individual within one or more categories, wherein the characteristic of the first individual is based on data associated with the first individual; an executable portion configured to determine multiple clusters, wherein each cluster comprises a subset of all possible outcomes for one of the categories, wherein the cluster comprises a group of values for an age and location of the population of interest; an executable portion configured to classify the first individual into one of the clusters, wherein classifying the first individual is based on the characteristic of the first individual within the one or more categories; an executable portion configured to classify other individuals from a database into one the clusters, wherein classifying each of the other individuals is based on a characteristic of the other individual within the one or more categories; an executable portion configured to order the multiple clusters into magnitudes from smallest to largest magnitude to generate a harp-shaped distribution presentation; an executable portion configured to determine a subset of the other individuals that are similar to the first individual, wherein the determination is based on the distribution of other individuals among the clusters; an executable portion configured to transform the characteristics of the subset of the other individuals to generate anonymous data responsive to the request for information, wherein the anonymous data is based on the distribution of the subset of the other the individuals among the clusters; and an executable portion configured to provide the anonymous data to the user.
A computer program provides anonymous data in response to a request for information. The program receives a request for information about a population of interest. It determines criteria defining a representative individual. It finds characteristics of that individual like age and location. It classifies the individual into a cluster with similar values, and classifies other individuals from a database into clusters based on the same characteristics. It orders the clusters by magnitude to create a harp-shaped distribution. It identifies a subset of other individuals similar to the representative individual based on the cluster distribution, transforms their characteristics to generate anonymous data based on their distribution within the clusters, and provides the anonymous data.
9. The computer program product of claim 8 , wherein the computer-readable program code portions further comprise: an executable portion configured to order the clusters of individuals based on a magnitude of each cluster and a distribution selected from the group consisting of a normal distribution, an exponential distribution, and a Poisson distribution; and an executable portion configured to determine a function that represents the individuals in the subset using a zeta function.
Building upon the computer program for providing anonymous data, the clusters of individuals are ordered based on their magnitude and a statistical distribution such as normal, exponential, or Poisson. The system then determines a function, specifically a zeta function, that represents the individuals within the subset used to generate the anonymous data. This function statistically models the characteristics of the subset.
10. The computer program product of claim 8 , wherein the computer-readable program code portions further comprise: an executable portion configured to determine the characteristic of the first individual within one or more categories by: identifying categories associated with the request for information; accessing a database comprising information associated with the population of interest; and determining the characteristic by comparing the categories associated with the request for information with the information stored in the database.
In the computer program for providing anonymous data, the determination of an individual's characteristics involves identifying relevant categories associated with the information request. The system accesses a database containing information about the population of interest and determines the individual's characteristics by comparing the request categories with the information stored in the database.
11. The computer program product of claim 8 , wherein the computer-readable program code portions further comprise: an executable portion configured to determine a magnitude of a cluster based on at least one of a number of individuals in the cluster and a transformation based on the number of individuals in the cluster and a value of the characteristic for each individual.
Within the computer program for generating anonymous data, the magnitude of a cluster is determined by at least one of: the number of individuals in the cluster, or a transformation based on both the number of individuals and the characteristic value for each individual in that cluster.
12. The computer program product of claim 11 , wherein the transformation is a product of the number of individuals in the cluster and the value of the characteristic for each individual.
In the computer program for generating anonymous data, the transformation for determining cluster magnitude (based on both the number of individuals and characteristic value) is specifically calculated as the product of the number of individuals in the cluster and the value of the characteristic for each of those individuals.
13. The computer program product of claim 8 , wherein the computer-readable program code portions further comprise: an executable portion configured to transform the characteristics of the individuals in the subset of the other individuals to generate anonymous data responsive to the request for information by determining an average for the subset of the other individuals.
In the computer program for generating anonymous data, the transformation of characteristics of the subset of individuals, used to generate the anonymous data, is achieved by calculating an average value for the characteristics of the individuals within that subset.
14. The computer program product of claim 8 , wherein the computer-readable program code portions further comprise: an executable portion configured to identify a plurality of characteristics of the first individual; an executable portion configured to classify a subset of the other individuals into clusters based on the plurality of characteristics; and an executable portion configured to determine a maximum number of characteristics, wherein the maximum number of characteristics is determined based on a predetermined minimum permissible number of individuals in the subset.
Enhancing the computer program for anonymous data, the system identifies multiple characteristics of the representative individual. Other individuals are classified into clusters based on these multiple characteristics. The system determines a maximum permissible number of characteristics to consider, based on a pre-defined minimum number of individuals that must be present in the resulting subset. This limits the dimensionality of the characteristics considered.
15. A method for providing anonymous data in response to a request for information, the method comprising: receiving, by a processing device, a request for information relating to a population of interest from a user; determining, by the processing device, criteria associated with the population of interest, wherein the criteria define a first individual representative of the population of interest; determining, by the processing device, a characteristic of the first individual within one or more categories, wherein the characteristic of the first individual is based on data associated with the first individual; determining multiple clusters, wherein each cluster comprises a subset of all possible outcomes for one of the categories, wherein the cluster comprises a group of values for an age and location of the population of interest; classifying, by the processing device, the first individual into one of the clusters, wherein classifying the first individual is based on the characteristic of the first individual within the one or more categories; classifying, by the processing device, other individuals from a database into one of the clusters, wherein classifying each of the other individuals is based on a characteristic of the other individual within the one or more categories; ordering the multiple clusters into magnitudes from smallest to largest magnitude to generate a harp-shaped distribution presentation; determining, by the processing device, a subset of the other individuals that are similar to the first individual, wherein the determination is based on the distribution of other individuals among the clusters; transforming, by the processing device, the characteristics of the subset of the other individuals to generate anonymous data responsive to the request for information, wherein the anonymous data is based on the distribution of the subset of the other the individuals among the clusters; and providing, by the processing device, the anonymous data to the user.
A method provides anonymous data in response to a request for information. The method receives a request for information about a population of interest. It determines criteria defining a representative individual. It finds characteristics of that individual like age and location. It classifies the individual into a cluster with similar values, and classifies other individuals from a database into clusters based on the same characteristics. It orders the clusters by magnitude to create a harp-shaped distribution. It identifies a subset of other individuals similar to the representative individual based on the cluster distribution, transforms their characteristics to generate anonymous data based on their distribution within the clusters, and provides the anonymous data.
16. The method of claim 15 , wherein the method further comprises: ordering the clusters of individuals based on a magnitude of each cluster and a distribution selected from the group consisting of a normal distribution, an exponential distribution, and a Poisson distribution; and determining a function that represents the individuals in the subset using a zeta function.
Building upon the method for providing anonymous data, the clusters of individuals are ordered based on their magnitude and a statistical distribution such as normal, exponential, or Poisson. The system then determines a function, specifically a zeta function, that represents the individuals within the subset used to generate the anonymous data. This function statistically models the characteristics of the subset.
17. The method of 15 , wherein the method further comprises determining the characteristic of the first individual within one or more categories by: identifying categories associated with the request for information; accessing a database comprising information associated with the population of interest; and determining the characteristic by comparing the categories associated with the request for information with the information stored in the database.
In the method for providing anonymous data, the determination of an individual's characteristics involves identifying relevant categories associated with the information request. The system accesses a database containing information about the population of interest and determines the individual's characteristics by comparing the request categories with the information stored in the database.
18. The method of 15 , wherein the method further comprises: determining a magnitude of a cluster based on at least one of a number of individuals in the cluster and a transformation based on the number of individuals in the cluster and a value of the characteristic for each individual.
Within the method for generating anonymous data, the magnitude of a cluster is determined by at least one of: the number of individuals in the cluster, or a transformation based on both the number of individuals and the characteristic value for each individual in that cluster.
19. The method of 18 , wherein the transformation is a product of the number of individuals in the cluster and the value of the characteristic for each individual.
In the method for generating anonymous data, the transformation for determining cluster magnitude (based on both the number of individuals and characteristic value) is specifically calculated as the product of the number of individuals in the cluster and the value of the characteristic for each of those individuals.
20. The method of claim 15 , wherein the method further comprises transforming the characteristics of the individuals in the subset of the other individuals to generate anonymous data responsive to the request for information by determining an average for the subset of the other individuals.
In the method for generating anonymous data, the transformation of characteristics of the subset of individuals, used to generate the anonymous data, is achieved by calculating an average value for the characteristics of the individuals within that subset.
Cooperative Patent Classification codes for this invention. Click any code to explore related patents in that topic.
February 5, 2015
May 30, 2017
Browse 5M+ US patents with plain-English claim translations and AI-generated analysis.