

Data-Centric Systems and Applications: Data Matching: Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection (Hardcover)
Key item features
Specs
- Book formatHardcover
- Fiction/nonfictionNon-Fiction
- GenreComputing & Internet
- Publication dateJuly, 2012
- Pages272
- Edition2012 ed.
How do you want your item?
About this item
Product details
Data matching (also known as record or data linkage, entity resolution, object identification, or field matching) is the task of identifying, matching and merging records that correspond to the same entities from several databases or even within one database. Based on research in various domains including applied statistics, health informatics, data mining, machine learning, artificial intelligence, database management, and digital libraries, significant advances have been achieved over the last decade in all aspects of the data matching process, especially on how to improve the accuracy of data matching, and its scalability to large databases.
Peter Christen's book is divided into three parts: Part I, "Overview", introduces the subject by presenting several sample applications and their special challenges, as well as a general overview of a generic data matching process. Part II, "Steps of the Data Matching Process", then details its main steps like pre-processing, indexing, field and record comparison, classification, and quality evaluation. Lastly, part III, "Further Topics", deals with specific aspects like privacy, real-time matching, or matching unstructured data. Finally, it briefly describes the main features of many research and open source systems available today.
By providing the reader with a broad range of data matching concepts and techniques and touching on all aspects of the data matching process, this book helps researchers as well as students specializing in data quality or data matching aspects to familiarize themselves with recent research advances and to identify open research challenges in the area of data matching. To this end, each chapter of the book includes a final section that provides pointers to further background and research material. Practitioners will better understand the current state of the art in data matching as well as the internal workings and limitations of current systems. Especially, they will learn that it is often not feasible to simply implement an existing off-the-shelf data matching system without substantial adaption and customization. Such practical considerations are discussed for each of the major steps in the data matching process.Specifications
Book format
Fiction/nonfiction
Genre
Publication date
Warranty
Warranty information
Similar items you might like
Based on what customers bought
Linking Sensitive Data: Methods and Techniques for Practical Privacy-Preserving Information Sharing, (Hardcover) $140.85
$14085current price $140.85Linking Sensitive Data: Methods and Techniques for Practical Privacy-Preserving Information Sharing, (Hardcover)
Data Handling in Science and Technology Resolving Spectral Mixtures: With Applications from Ultrafast Time-Resolved Spectroscopy to Super-Resolution Imaging Vol, Book 30, (Hardcover) $225.83
$22583current price $225.83Data Handling in Science and Technology Resolving Spectral Mixtures: With Applications from Ultrafast Time-Resolved Spectroscopy to Super-Resolution Imaging Vol, Book 30, (Hardcover)
Intelligent Systems Reference Library Data Mining: Concepts, Models and Techniques, Book 12, (Hardcover) $163.87
$16387current price $163.87Intelligent Systems Reference Library Data Mining: Concepts, Models and Techniques, Book 12, (Hardcover)
Studies in Fuzziness and Soft Computing Grade Models and Methods for Data Analysis: With Applications for the Analysis of Data Populations, Book 151, (Hardcover) $211.15
$21115current price $211.15Studies in Fuzziness and Soft Computing Grade Models and Methods for Data Analysis: With Applications for the Analysis of Data Populations, Book 151, (Hardcover)
Advances in Database Systems Privacy-Preserving Data Mining: Models and Algorithms, Book 34, (Hardcover) $184.90
$18490current price $184.90Advances in Database Systems Privacy-Preserving Data Mining: Models and Algorithms, Book 34, (Hardcover)
International Operations Research & Mana Models, Methods, Concepts & Applications of the Analytic Hierarchy Process, Book 175, (Hardcover) $192.19
$19219current price $192.19International Operations Research & Mana Models, Methods, Concepts & Applications of the Analytic Hierarchy Process, Book 175, (Hardcover)
Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management (Hardcover) $183.99
$18399current price $183.99Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management (Hardcover)
Data Preprocessing, Active Learning, and Cost Perceptive Approaches for Resolving Data Imbalance (Paperback) $190.00
$19000current price $190.00Data Preprocessing, Active Learning, and Cost Perceptive Approaches for Resolving Data Imbalance (Paperback)
Premier Reference Source Pattern Recognition and Signal Processing in Archaeometry: Mathematical and Computational Solutions for Archaeology, (Hardcover) $200.76
$20076current price $200.76Premier Reference Source Pattern Recognition and Signal Processing in Archaeometry: Mathematical and Computational Solutions for Archaeology, (Hardcover)
Adaptive and Cognitive Dynamic Systems: Handbook on Array Processing and Sensor Networks, Book 60, (Hardcover) $230.42 Was $244.50
$23042current price $230.42, Was $244.50$244.50Adaptive and Cognitive Dynamic Systems: Handbook on Array Processing and Sensor Networks, Book 60, (Hardcover)
Beyond Measure Renewing Your Mind God's Way, Book 1, (Paperback) $13.12
$1312current price $13.12Beyond Measure Renewing Your Mind God's Way, Book 1, (Paperback)
International Intelligent Technologies Traffic Control and Transport Planning:: A Fuzzy Sets and Neural Networks Approach, Book 13, (Hardcover) $171.17
$17117current price $171.17International Intelligent Technologies Traffic Control and Transport Planning:: A Fuzzy Sets and Neural Networks Approach, Book 13, (Hardcover)
Implementing Computational Intelligence Techniques for Security Systems Design, (Hardcover) $188.45
$18845current price $188.45Implementing Computational Intelligence Techniques for Security Systems Design, (Hardcover)
Digital Communications with Emphasis on Data Modems: Theory, Analysis, Design, Simulation, Testing, and Applications, (Hardcover) $172.21
$17221current price $172.21Digital Communications with Emphasis on Data Modems: Theory, Analysis, Design, Simulation, Testing, and Applications, (Hardcover)
The Functional Approach to Data Management: Modeling, Analyzing and Integrating Heterogeneous Data, (Hardcover) $145.59
$14559current price $145.59The Functional Approach to Data Management: Modeling, Analyzing and Integrating Heterogeneous Data, (Hardcover)
Data-Driven and Model-Based Methods for Fault Detection and Diagnosis, (Paperback) $175.59
$17559current price $175.59Data-Driven and Model-Based Methods for Fault Detection and Diagnosis, (Paperback)
Lecture Notes in Networks and Systems Data Analytics in System Engineering: Proceedings of 7th Computational Methods in Systems and Software 2023, Vol. 4, Book 935, (Paperback) $169.65 Was $199.99
$16965current price $169.65, Was $199.99$199.99Lecture Notes in Networks and Systems Data Analytics in System Engineering: Proceedings of 7th Computational Methods in Systems and Software 2023, Vol. 4, Book 935, (Paperback)
Intelligent Data-Centric Systems Diagnostic Biomedical Signal and Image Processing Applications with Deep Learning Methods, (Paperback) $175.59
$17559current price $175.59Intelligent Data-Centric Systems Diagnostic Biomedical Signal and Image Processing Applications with Deep Learning Methods, (Paperback)
Advances in Database Systems Replication Techniques in Distributed Systems, Book 4, (Paperback) $199.85
$19985current price $199.85Advances in Database Systems Replication Techniques in Distributed Systems, Book 4, (Paperback)
Studies in Fuzziness and Soft Computing Real Life Applications of Multiple Criteria Decision Making Techniques in Fuzzy Domain, Book 420, (Hardcover) $183.51
$18351current price $183.51Studies in Fuzziness and Soft Computing Real Life Applications of Multiple Criteria Decision Making Techniques in Fuzzy Domain, Book 420, (Hardcover)
