|
|
Publications and selected presentations by
Peter Christen
Go to presentations
Publications:
2013:
- A Taxonomy of Privacy-Preserving Record Linkage Techniques
Dinusha Vatsalan, Peter Christen, and Vassilios Verykios.
In
Information Systems (Elsevier), volume 38, issue 6,
September 2013, Pages 946-969.
Article available online at
http://dx.doi.org/10.1016/j.is.2012.11.005.
- Adaptive Temporal Entity Resolution on Dynamic Databases
Peter Christen and Ross Gayler.
Proceedings of the
Seventeenth
Pacific-Asia Conference on Knowledge Discovery and Data
Mining (PAKDD'13), Gold Coast, Australia, April 2013.
Article available online from
Springer Link.
- Sorted Nearest Neighborhood Clustering for Efficient Private
Blocking
Dinusha Vatsalan and Peter Christen.
Proceedings of the
Seventeenth
Pacific-Asia Conference on Knowledge Discovery and Data
Mining (PAKDD'13), Gold Coast, Australia, April 2013.
Article available online from
Springer Link.
- Cross Language Prediction of Vandalism on Wikipedia using
Article Views and Revisions
Khoi-Nguyen Tran and Peter Christen.
Proceedings of the
Seventeenth
Pacific-Asia Conference on Knowledge Discovery and Data
Mining (PAKDD'13), Gold Coast, Australia, April 2013.
Article available online from
Springer Link.
- Dynamic Similarity-Aware Inverted Indexing for Real-Time Entity
Resolution
Banda Ramadan, Peter Christen, Huizhi Liang, Ross Gayler, and David
Hawking.
In proceedings of the
International Workshop
on Data Mining Applications in Industry and Government
(DMApps 2013),
held at the Seventeenth Pacific-Asia Conference on Knowledge Discovery and Data
Mining (PAKDD'13), Gold Coast, Australia, April 2013.
Paper
(pdf, 272 KB)
- Predicting High Impact Academic Papers using Citation Network
Features
Daniel McNamara, Paul Wong, Peter Christen and Kee Siong Ng.
In proceedings of the
International Workshop
on Data Mining Applications in Industry and Government
(DMApps 2013),
held at the Seventeenth Pacific-Asia Conference on Knowledge Discovery and Data
Mining (PAKDD'13), Gold Coast, Australia, April 2013.
Paper
(pdf, 328 KB)
2012:
- A Bag Reconstruction Method for Multiple
Instance Classification and Group Record Linkage
Zhichun Fu, Jun Zhou, Furong Peng, and Peter Christen.
Proceedings of the
Eighth International
Conference on Advanced Data Mining and Applications
(ADMA'12), Nanjing, China, December 2012.
Article available online from
Springer Link.
- An Iterative Two-Party Protocol for Scalable
Privacy-Preserving Record Linkage
Dinusha Vatsalan and Peter Christen.
Proceedings of the Tenth Australasian Data Mining Conference
(AusDM'12), Sydney, December 2012.
Paper,
(pdf, 279 KB)
- CA4IOT: Context Awareness for Internet of Things
Charith Perera, Arkady Zaslavsky, Peter Christen, and Dimitrios
Georgakopoulos,
Proceedings of the
IEEE International
Conference on Green Computing and Communications, Conference
on Internet of Things, and Conference on Cyber, Physical and
Social Computing (GreenCom/iThings/CPSCom'12), Besancon,
France, November 2012.
- Time-aware Topic Recommendation Based on Micro-blogs
Huizhi Liang, Yue Xu, Dian Tjondronegoro, and Peter Christen.
Proceedings of the
ACM Conference on Information
and Knowledge Management (CIKM'12), Hawaii, October 2012
- Capturing Sensor Data from Mobile Phones using Global
Sensor Network Middleware
Charith Perera, Arkady Zaslavsky, Peter Christen, Ali Salehi,
and Dimitrios Georgakopoulos.
Proceedings of the
IEEE International
Workshop on Internet-of-Things Communications and Networking
(IoT-CN),
held at the 23rd IEEE International Symposium on
Personal, Indoor and Mobile Radio Communications (PIMRC),
Sydney, September 2012.
- A Survey of Indexing Techniques for Scalable Record
Linkage and Deduplication
Peter Christen
In
IEEE
Transactions on Knowledge and Data Engineering (TKDE),
vol. 12, no. 9, September 2012.
Article available online from
Computer.org digital library.
- Data Matching -
Concepts and Techniques for Record Linkage, Entity Resolution,
and Duplicate Detection
Peter Christen.
Springer, Data-Centric Systems and Applications, August 2012.
Preface, table of
contents, and references are available for download.
- Event Diffusion Patterns in Social Media
Minkyoung Kim, Lexing Xie, and Peter Christen.
International
AAAI Conference on Weblogs and Social Media,
Dublin, June 2012.
Paper (pdf, 1.3 MB)
- Multiple Instance Learning for Group Record Linkage
Zhichun Fu, Jun Zhou, Peter Christen and Mac Boot.
Proceedings of the
Sixteenth
Pacific-Asia Conference on Knowledge Discovery and Data
Mining (PAKDD'12), Kuala Lumpur, May-June 2012.
Article available online from
Springer Link.
- Connecting Mobile Things to Global Sensor Network
Middleware using System-generated Wrappers
Charith Perera, Arkady Zaslavsky, Peter Christen, Ali Salehi,
and Dimitrios Georgakopoulos.
Proceedings of the ACM International Workshop on Data
Engineering for Wireless and Mobile Access (MobiDE),
ACM Special Interest Group on Management of Data and
Principles of Database Systems (SIGMOD/PODS),
Scottsdale, Arizona, May 2012.
Paper available online from
ACM Digital Library.
- New Objective Functions for Social Collaborative Filtering.
Joseph Noel, Scott Sanner, Khoi-Nguyen Tran, Peter Christen,
Lexing Xie, Edwin Bonilla, Ehsan Abbasnejad, and
Nicolas Della Penna.
World Wide Web
conference (WWW'12), Lyon, April 2012.
Paper available online from
WWW'12 Proceedings.
2011:
- Automatic Cleaning and Linking of Historical Census Data
using Household Information
Zhichun Fu, Peter Christen and Mac Boot.
Proceedings of the
Fifth
International Workshop on Domain Driven Data Mining
(DDDM'11), held at
IEEE ICDM,
Vancouver, December 2011.
- Proceedings of the Ninth Australasian Data Mining
Conference (AusDM'11)
Peter Vamplew, Andrew Stranieri, Kok-Leong Ong, Peter Christen
and Paul Kennedy (editors).
Proceedings of the
Ninth Australasian Data Mining Conference,
Ballarat, December 2011.
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 121.
- An Efficient Two-Party Protocol for Approximate Matching
in Private Record Linkage
Dinusha Vatsalan, Peter Christen and Vassilios Verykios.
Proceedings of the Ninth Australasian Data Mining Conference
(AusDM'11), Ballarat, December 2011.
Paper (pdf, 880 KB) available online from
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 121.
- A Supervised Learning and Group Linking Method
for Historical Census Household Linkage
Zhichun Fu, Peter Christen and Mac Boot.
Proceedings of the Ninth Australasian Data Mining Conference
(AusDM'11), Ballarat, December 2011.
Paper (pdf, 860 KB) available online from
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 121.
- Fake Injection Strategies for Private Phonetic Matching
Alexandros Karakasidis, Vassilios Verykios and Peter Christen.
Proceedings of the
International Workshop on
Data Privacy Management (DPM2011), Leuven, Belgium, September 2011.
- Analysis of Cluster Migrations using Self-Organizing Maps
Denny, Peter Christen and Graham Williams.
Proceedings of the
International Workshop on Behavior Informatics (BI2011), 15th
Pacific-Asia Conference on Knowledge Discovery and Data Mining
(PAKDD2011), Shenzhen, China, May 2011.
- Robust Record Linkage Blocking using Suffix Arrays and Bloom
Filters
Timothy de Vries, Hui Ke, Sanjay Chawla and Peter Christen.
In ACM Transactions
on Knowledge Discovery from Data, vol. 2, no. 5, February 2011.
Available online.
2010:
- Visualizing Temporal Cluster Changes using Relative Density
Self-Organizing Maps
Denny, Graham Williams and Peter Christen
In Knowledge and Information Systems Springer, vol. 25, no. 2,
November 2010.
Paper
available online.
-
New Frontiers in Applied Data Mining
T. Theeramunkong, C. Nattee, P.J.L. Adeodato, N. Chawla; Peter Christen,
P. Lenca, J. and G Williams (editors).
Revised Selected Papers from the Pacific-Asia Conference on Knowledge
Discovery and Data Mining (PAKDD) workshops, Bangkok, Thailand,
April 2009.
2009:
- Data Mining and Analytics 2009
Paul Kennedy, Kok-Leong Ong and Peter Christen (editors).
Proceedings of the
Seventh Australasian Data Mining Conference
(AusDM 2009), Melbourne, December 2009.
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 101.
- Robust Record Linkage Blocking using Suffix Arrays
Timothy de Vries, Hui Ke, Sanjay Chawla and Peter Christen.
Proceedings of the
ACM Conference on Information and Knowledge
Management (CIKM), Hong Kong, November 2009.
Paper
available online.
- Similarity-Aware Indexing for Real-Time Entity
Resolution
Peter Christen, Ross Gayler and David Hawking.
Proceedings of the
ACM Conference on Information and Knowledge
Management (CIKM), Hong Kong, November 2009.
Paper
available online.
The full paper (10 pages) is published as an
ANU Computer Science
technical report.
Report (pdf, 273 KB)
Report (ps.gz, 285 KB)
- Development and User Experiences of an Open Source
Data Cleaning, Deduplication and Record Linkage System
Peter Christen
In SIGKDD Explorations, Volume 11, Issue 1,
July 2009.
Available online:
Paper (pdf, 778 KB)
- Accurate Synthetic Generation of Realistic Personal
Information
Peter Christen and Agus Pudjijono
Proceedings of the
Pacific-Asia Conference on Knowledge Discovery
and Data Mining (PAKDD), Bangkok, Thailand, April 2009.
Paper available online.
Submitted paper
(12 pages, pdf, 645 KB)
- Geocode Matching and Privacy Preservation
Peter Christen
Invited Presentation at the
PinKDD 2008 workshop held at the
ACM SIGKDD 2008 conference, Las Vegas,
August 2008.
In Revised, Selected Papers, F. Bonchi, E. Ferrari,
W. Jiang and B. Malin (editors).
Springer Lecture Notes in Computer Science (LNCS), vol. 5456, 2009.
Paper available online.
2008:
- Visualization of Temporal Changes in Cluster Structures
using Self-Organizing Maps
Denny, Graham Williams, and Peter Christen
Accepted as regular paper for the
IEEE
International Conference on Data Mining (ICDM), Pisa, Italy,
December 2008.
Please contact
Denny if you are interested in this paper.
- Data Mining and Analytics 2008
John Roddick, Jiuyong Li, Peter Christen and Paul Kennedy
(editors).
Proceedings of the
Seventh Australasian Data Mining Conference
(AusDM 2008), Glenelg, Adelaide, November
2008.
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 87.
- Towards Scalable Real-Time Entity Resolution using a
Similarity-Aware Inverted Index Approach
Peter Christen and Ross Gayler
In proceedings of the Seventh Australasian Data Mining
Conference (AusDM 2008), Glenelg, Adelaide, November
2008.
Paper
(pdf, 218 KB) available online from
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 87.
- Automatic Record Linkage using Seeded Nearest Neighbour
and Support Vector Machine Classification
Peter Christen
Proceedings of the ACM SIGKDD 2008 conference, Las Vegas,
August 2008.
Paper available online.
- Febrl - An Open Source Data Cleaning, Deduplication and
Record Linkage System with a Graphical User Interface
Peter Christen
Proceedings of the
demo session
at the ACM SIGKDD 2008
conference, Las Vegas, August 2008.
Paper available online.
- Automatic Training Example Selection for Scalable
Unsupervised Record Linkage
Peter Christen
Proceedings of the
Pacific-Asia Conference on Knowledge Discovery
and Data Mining (PAKDD), Osaka, Japan, May 2008.
Paper available online.
Submitted paper
(12 pages, pdf, 146 KB)
Submitted
paper (12 pages, ps.gz, 142 KB)
- Exploratory Hot Spot Profile Analysis using Interactive
Visual Drill-Down Self-Organizing Maps
Denny, Graham J. Williams and Peter Christen.
Proceedings of the
Pacific-Asia Conference on Knowledge Discovery
and Data Mining (PAKDD), Osaka, Japan, May 2008.
Paper available online.
- Febrl - A Freely Available Record Linkage System
with a Graphical User Interface
Peter Christen
Proceedings of the
Australasian Workshop on Health Data and
Knowledge Management (HDKM), Wollongong, January 2008.
Paper
(pdf, 748 KB) available online from
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 80.
2007:
- Data Mining and Analytics 2007
Peter Christen, Paul J. Kennedy, Jiuyong Li, Inna Kolyshkina
and Graham J. Williams (editors).
Proceedings of the
Sixth Australasian Data Mining Conference
(AusDM 2007), Gold Coast, Australia, December 2007.
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 70.
- A Two-Step Classification Approach to Unsupervised Record
Linkage
Peter Christen
In proceedings of the Sixth Australasian Data Mining Conference
(AusDM 2007), Gold Coast, December 2007.
Paper
(pdf, 440 KB) available online from
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 70.
- Exploratory Multilevel Hot Spot Analysis: Australian
Taxation Office Case Study
Denny, Graham J. Williams, and Peter Christen
In proceedings of the Sixth Australasian Data Mining Conference
(AusDM 2007), Gold Coast, December 2007.
Paper
(pdf, 759 KB) available online from
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 70.
- Evaluation of a Graduate Level Data Mining Course
with Industry Participants
Peter Christen
In proceedings of the Sixth Australasian Data Mining Conference
(AusDM 2007), Gold Coast, December 2007.
Paper
(pdf, 436 KB) available online from
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 70.
- Towards parameter-free blocking for scalable record
linkage
Peter Christen
Technical Report TR-CS-07-03
ANU Joint Computer Science Technical Report
Series, August 2007.
Report
(pdf, 201 KB)
Report
(ps.gz, 199 KB)
- Quality and Complexity Measures for Data Linkage and
Deduplication
Peter Christen and Karl Goiser
Chapter in the book
Quality
Measures in Data Mining, vol. 43,
Studies in Computational Intelligence.
F. Guillet and H. Hamilton (eds), Springer, March 2007.
Available online at
SpringerLink.
2006:
- Privacy-Preserving Data Linkage and Geocoding: Current
Approaches and Research Directions
Peter Christen
In proceedings of the Workshop on Privacy Aspects of Data Mining (PADM)
held at the IEEE International Conference on Data
Mining (ICDM), Hong Kong, December 2006.
Final 5-page version:
Paper
(pdf, 53 KB)
Paper
(ps.gz, 35 KB)
Submitted 11-page version:
Paper
(pdf, 118 KB)
Paper
(ps.gz, 74 KB)
- A Comparison of Personal Name Matching: Techniques and
Practical Issues
Peter Christen
In proceedings of the Workshop on Mining Complex Data (MCD)
held at the IEEE International Conference on Data
Mining (ICDM), Hong Kong, December 2006.
Final 5-page version:
Paper
(pdf, 57 KB)
Paper
(ps.gz, 40 KB)
Submitted 12-page version available as:
Technical Report TR-CS-06-02
ANU Joint Computer Science Technical Report
Series, September 2006.
Report
(pdf, 248 KB)
Report
(ps.gz, 236 KB)
- Dynamic Algorithm Selection Using Reinforcement Learning
Warren Armstrong, Peter Christen, Eric McCreath and Alistair
Rendell
Proceedings of the
Workshop on Integrating AI and Data Mining,
Hobart, Australia, December 2006.
Paper
(pdf, 254 KB)
- Data Mining and Analytics 2006
Peter Christen, Paul J. Kennedy, Jiuyong Li, Simeon J. Simoff
and Graham J. Williams (editors).
Proceedings of the Fifth Australasian Data Mining Conference
(AusDM 2006), Sydney, November, 2006.
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 61.
- Towards Automated Record Linkage
Karl Goiser and Peter Christen
In proceedings of the Fifth Australasian Data Mining Conference
(AusDM 2006), Sydney, November 2006.
Paper
(pdf, 513 KB) available online from
Conferences in Research and Practice in
Information Technology (CRPIT), vol. 61.
- Secure Health Data Linkage and Geocoding: Current
Approaches and Research Directions
Peter Christen and Tim Churches
Proceedings of the
National e-Health Privacy and Security
Symposium (ehPASS), Brisbane, October 2006.
Paper
(pdf, 139 KB)
Paper
(ps.gz, 127 KB)
- Automated Geocoding of Routinely Collected Health Data
in New South Wales
Richard Summerhayes, Paul Holder, John Beard, Peter
Christen, Alan Willmore and Tim Churches
The NSW Public Health Bulletin,
volume 17, number 3-4, March-April 2006.
Online version available
here.
- A Probabilistic Geocoding System Utilising a Parcel Based
Address File
Peter Christen, Alan Willmore and Tim Churches
In Advances in Data Mining: Theory, Methodology,
Techniques, and Applications. Simeon Simoff and Graham
Williams (editors). State-of-the-Art Lecture Notes in
Artificial Intelligence, Volume 3755, Springer-Verlag,
2006.
Available online at
SpingerLink, LNCS 3755.
Copyright for this publication is held by the Springer
Verlag.
2005:
- Automated Probabilistic Address Standardisation and
Verification
Peter Christen and Daniel Belacic
Proceedings of the
fourth Australasian
Data Mining Conference (AusDM 2005), Sydney, December 2005.
Paper
(pdf, 146 KB)
Paper
(ps.gz, 204 KB)
- Assessing Deduplication and Data Linkage Quality: What to
Measure?
Peter Christen and Karl Goiser
Proceedings of the
fourth Australasian
Data Mining Conference (AusDM 2005), Sydney, December 2005.
Paper
(pdf, 178 KB)
Paper
(ps.gz, 163 KB)
- Probabilistic Data Generation for Deduplication and
Data Linkage
Peter Christen
Proceedings of the
Sixth
International Conference on Intelligent Data Engineering
and Automated Learning (IDEAL'05), Brisbane, July 2005.
Copyright for this publication is held by the Springer Verlag.
Available online at
SpingerLink,
LNCS 3578.
Paper
(pdf, 124 KB)
Paper
(ps.gz, 135 KB)
- Febrl - Freely extensible biomedical record linkage
(Manual, release 0.3)
Peter Christen and Tim Churches
Available online from
SourceForge.Net, April
2005.
Manual
(pdf, 960 KB)
Manual
(pdf, 282 KB)
- A Probabilistic Deduplication, Record Linkage
and Geocoding System
Peter Christen and Tim Churches
Proceedings of the
ARC Health Data Mining workshop,
University of South Australia, April 2005.
Paper
(pdf, 136 KB)
Paper
(ps.gz, 134 KB)
2004:
- A Probabilistic Geocoding System based on
a National Address File
Peter Christen, Tim Churches and Alan Willmore
Proceedings of the
Australasian Data Mining Conference,
Cairns, December 2004.
Paper
(pdf, 120 KB)
Paper
(ps.gz, 128 KB)
- Some Methods for Blindfolded Record Linkage
Tim Churches and Peter Christen
Published online at
BioMed Central
Medical Informatics and Decision Making,
June 2004.
For abstract and downloadable PDF file see
here.
- Febrl - A Parallel Open Source Data Linkage System
Peter Christen, Tim Churches and Markus Hegland
Proceedings of the 8th
PAKDD'04
(Pacific-Asia Conference on Knowledge Discovery and Data
Mining), Sydney, May 2004.
Springer Lecture Notes in Artificial Intelligence,
(3056), available online at
Springerlink.
Copyright for this publication is held by the Springer Verlag.
Paper
(pdf, 202 KB)
Paper
(ps.gz, 81 KB)
- Blind Data Linkage using n-gram Similarity
Comparisons
Tim Churches and Peter Christen
Proceedings of the 8th
PAKDD'04
(Pacific-Asia Conference on Knowledge Discovery and Data
Mining), Sydney, May 2004.
Springer Lecture Notes in Artificial Intelligence,
(3056), available online
here.
Copyright for this publication is held by the Springer Verlag.
Paper
(pdf, 176 KB)
Paper
(ps.gz, 68 KB)
2003:
2002:
- Preparation of name and address data for record linkage
using hidden Markov models
Tim Churches, Peter Christen, Kim Lim and Justin X Zhu
Published online at BioMed Central
Medical Informatics and Decision Making,
December 2002.
For abstract and downloadable PDF file see
here.
Also available locally:
Paper
(pdf, 353 KB)
- Probabilistic Name and Address Cleaning and
Standardisation
Peter Christen, Tim Churches and Justin Xi Zhu
Proceedings of the
Australasian
Data Mining Workshop, Canberra, December 2002.
Paper
(ps.gz, 74 KB)
Paper
(pdf, 158 KB)
- How Fast is '-fast'? Performance Analysis of KDD
Applications using Hardware Performance Counters on
UltraSPARC-III
Adam Czezowski and Peter Christen
Proceedings of the
Australasian
Data Mining Workshop, Canberra, December 2002.
Paper
(ps.gz, 82 KB)
Paper
(pdf, 174 KB)
- High-Performance Computing Techniques for
Record Linkage
Peter Christen, Justin Xi Zhu, Markus Hegland, Stephen Roberts,
Ole M. Nielsen, Tim Churches and Kim Lim
Proceedings of the Australian Health Outcomes
Conference (AHOC-2002), Canberra, July 2002.
Paper
(ps.gz, 95 KB)
Paper
(pdf, 233 KB)
- Parallel Computing Techniques for
High-Performance Probabilistic Record Linkage
Peter Christen, Markus Hegland, Stephen Roberts,
Ole M. Nielsen, Tim Churches and Kim Lim
Proceedings of the Symposium on Health Data
Linkage, Sydney, March 2002.
Paper
(ps.gz, 107 KB)
Paper
(pdf, 228 KB)
- Performance Analysis of KDD Applications using
Hardware Event Counters
Peter Christen and Adam Czezowski
Technical Report TR-CS-02-01, ANU Joint Computer
Science Technical Report Series, February 2002.
Report
(ps.gz, 131 KB)
Report
(pdf, 238 KB)
2001:
- DMtools - Open Source Software for Database Mining
Peter Christen, Ole M. Nielsen and Markus Hegland
Accepted by the
Workshop on Database Support for KDD (at the
PKDD'2001
Conference), Freiburg, Germany, September 2001.
Paper (ps.gz, 81 KB)
- Parallel Data Mining on a Beowulf Cluster
Peter Christen, Ole M. Nielsen, Markus Hegland and
Peter E. Strazdins
Proceedings of the HPC Asia 2001
Conference, Gold Coast, Queensland, Australia,
September 2001.
Paper (ps.gz, 264 KB)
Paper (pdf.gz, 311 KB)
- A Scalable Parallel FEM Surface Fitting Algorithm for Data
Mining
Peter Christen, Markus Hegland, Stephen Roberts, Ole M.
Nielsen and Irfan Altas
Proceedings of the International Workshop on Mining Spatial
and Temporal Data
(at the
PAKDD-2001
Conference), Hong Kong, April 2001.
Paper (ps.gz, 229 KB)
- A Toolbox Approach to Flexible and Efficient
Data Mining
Ole M. Nielsen, Peter Christen, Markus Hegland,
Tatiana Semenova and Timothy Hancock
Proceedings of the
PAKDD-2001
Conference, Hong Kong, April 2001.
Published in the
Springer
Lecture Notes in Computer Science, Artificial
Intelligence series, LNAI2035.
Copyright for this publication is held by the Springer
Verlag.
Paper (ps.gz, 143 KB)
Paper (pdf, 183 KB)
- Towards a Parallel Data Mining Toolbox
Peter Christen, Markus Hegland, Ole M. Nielsen, Stephen
Roberts, Peter E. Strazdins, Irfan Altas, Tatiana Semenova and
Timothy Hancock
Proceedings of the 15th International Parallel and Distributed
Processing Symposium (IPDPS-2001), San Francisco,
April 2001.
Workshop Parallel
and Distributed Data Mining.
Copyright 2001 Institute of Electrical and Electronic
Engineers (IEEE). Reprinted for the Proceedings of the
IPDPS-2001.
Paper (ps.gz, 139 KB)
- Data Mining with Python
Ole M. Nielsen, Peter Christen, Markus Hegland and Tatiana
Semenova
Proceedings of the
9th International Python
Conference, Long Beach, California, March 2001.
Paper available upon request from:
Ole Nielsen.
- A Scalable Parallel FEM Surface Fitting Algorithm for Data
Mining
Peter Christen, Markus Hegland, Stephen Roberts and Irfan Altas
Technical Report TR-CS-01-01, ANU Joint Computer Science
Technical Report Series, January 2001.
Report (ps.gz, 255 KB)
Report (pdf, 377 KB)
2000:
- Scalable Parallel Algorithms for Surface Fitting and Data
Mining
Peter Christen, Markus Hegland, Ole M. Nielsen, Stephen
Roberts, Peter E. Strazdins and Irfan Altas
Accepted for publication by the Elsevier Journal of
Parallel Computing,
special issue on Aspects of Parallel Computing for Linear
Systems and Associated Problems, September 2000.
- Data Mining of Administrative Claims Data of Pathology
Services
Simon Hawkins, Graham Williams, Rohan Baxter, Peter Christen,
Michael Fett, Markus Hegland, Fuchun Huang, Ole Nielsen, Tatiana
Semenova and Andrew Smith
Accepted by the Thirty-Fourth Hawaii International Conference on
System Sciences (HICSS-34), January 2001.
Available upon request from:
Rohan Baxter,
CSIRO CMIS.
- Scalable Parallel Algorithms for Predictive Modelling
Peter Christen, Markus Hegland, Ole Møller Nielsen,
Stephen Roberts and Irfan Altas
Proceedings of the Data Mining 2000 Conference, Cambridge, UK,
N. Ebecken and C.A. Brebbia, editors, in Data Mining II,
WIT Press, Southhampton Boston, 2000.
Paper
(ps.gz, 606 KB)
1999:
- The Integrated Delivery of Large-Scale Data Mining:
The ACSYS Data Mining Project
Graham Williams, Irfan Altas, Sergey Barkin, Peter Christen,
Markus Hegland, Alonso Marquez, Peter Milne, Rajehndra Nagappan
and Stephen Roberts
KDD-99 Workshop on Large-Scale Parallel KDD Systems. San Diego,
August 1999,
Springer Lecture Notes in Artificial Intelligence 1759.
- Parallelization of a Finite Element Surface Fitting Algorithm
for Data Mining
Peter Christen, Irfan Altas, Markus Hegland, Stephen Roberts,
Kevin Burrage and Roger Sidje.
Proceedings of the CTAC-99 Conference. Canberra, 20-24 September
1999.
Paper
(ps.gz, 552 KB)
Slides
(ps.gz, 614 KB)
- A Parallel Iterative Linear System Solver
with Dynamic Load Balancing
Peter Christen
Proceedings of the CTAC-99 Conference. Canberra, 20-24 September
1999.
Paper
(ps.gz, 467 KB)
Slides (ps.gz, 218 KB)
- A Parallel Finite Element Surface Fitting Algorithm for
Data Mining
Peter Christen, Irfan Altas, Markus Hegland, Stephen Roberts,
Kevin Burrage and Roger Sidje
Proceedings of the ParCo-99 Conference, Delft,
17-20 August 1999.
- A Parallel Iterative Linear System Solver with
Dynamic Load Balancing
Peter Christen
Dissertation (PhD thesis), Institut für Informatik,
Universität Basel. February 1999.
Available upon
request.
1998:
- PAISS - Design and Implementation of a Parallel
Iterative Linear System Solver with Dynamic Load
Balancing
Peter Christen
Technischer Bericht 98-5, October 1998.
Report (ps.gz, 193 KB)
- A Parallel Iterative Linear System Solver with
Dynamic Load Balancing
Peter Christen
Proceedings of the ACM International Conference of
Supercomputing (ICS) 1998. Melbourne, 13-17 July 1998.
- Dynamic Load Balancing within a Parallel Iterative
Linear System Solver
Peter Christen.
Proceedings of the High-Performance Computing and Networking
(HPCN) Conference 1998. Amsterdam, 21-23 April 1998,
Springer Lecture Notes in Computer Science 1401.
1996:
- Speicher-Schemata für spärlich besetzte
Matrizen (German)
Peter Christen
Institut für Informatik, Universität Basel.
Technischer Bericht 96-4, September 1996.
Report (ps.gz, 203 KB)
1995:
- Test- und Diagnosesoftware für Alpha7
(German)
Peter Christen
Diplomarbeit (MS thesis), Institut für Elektronik,
ETH Zürich. Prof.Dr. A. Gunzinger, July 1995.
Selected Presentations:
2012:
2011:
- Privacy-Preserving Data Matching
Peter Christen
Invited presentation to the Data Matching Working Group
Australian Government Attorney-General's Department,
Canberra, July 2011.
Slides
8up (pdf, 2.0 MB)
- Scalable Privacy-Preserving Record Linkage using
Similarity-Based Indexing
Invited presentation at
Fujitsu Laboratories, Kawasaki, Japan, June 2011.
Slides available upon
request.
2010:
2009:
- Privacy-preserving Data Sharing and Matching
Peter Christen
Departmental seminar at the
ANU Computer
Sciences Lab, Canberra, May 2009.
Slides
4up (pdf, 2.1 MB)
Slides
1up (pdf, 634 KB)
- Accurate Synthetic Generation of Realistic Personal
Information
Peter Christen and Agus Pudjijono
Presentation at the
Pacific-Asia Conference on Knowledge Discovery
and Data Mining (PAKDD 2009), Bangkok, Thailand, April 2009.
Slides
6up (pdf, 1.5 MB)
Slides
1up (pdf, 792 KB)
- Data Linkage - An Overview and Research at the ANU
Peter Christen
Invited Presentation at the
ANU Supercomputer Facility, Canberra, March 2009.
Slides
6up (pdf, 2.2 MB)
Slides
1up (pdf, 721 KB)
2008:
- Towards Scalable Real-Time Entity Resolution using a
Similarity-Aware Inverted Index Approach
Peter Christen and Ross Gayler
Presentation at the
Seventh Australasian Data Mining Conference
(AusDM 2008), Glenelg, Adelaide, November 2008.
Slides
4up (pdf, 1.4 MB)
Slides
1up (pdf, 491 KB)
- Privacy-Preserving Data Linkage
Peter Christen
Part of the tutorial on Privacy preserving data sharing
and mining held at the Seventh Australasian Data Mining Conference
(AusDM 2008), Glenelg, Adelaide, November 2008.
Slides
4up (pdf, 3.2 MB)
Slides
1up (pdf, 872 KB)
- Data Matching of Bibliographic Data: Recent Advances
and an Open Source Matching System
Peter Christen
Presentation at the
2008 Annual Forum of the
Australasian
Association for Institutional Research (AAIR), Canberra,
November 2008.
Slides
6up (pdf, 1.9 MB)
Slides
1up (pdf, 839 KB)
- Automatic Record Linkage using Seeded Nearest Neighbour
and Support Vector Machine Classification
Peter Christen
Presentation at the ACM SIGKDD 2008 conference, Las Vegas,
August 2008.
Slides
6up (pdf, 813 KB)
Slides
1up (pdf, 293 KB)
- Geocode Matching and Privacy Preservation
Invited Presentation at the
PinKDD 2008 workshop held at the
ACM SIGKDD 2008 conference, Las Vegas,
August 2008.
Slides
9up (pdf, 1.5 MB)
Slides
1up (pdf, 513 KB)
- Automatic Training Example Selection for Scalable
Unsupervised Record Linkage
Peter Christen
Presentation at the
Pacific-Asia Conference on Knowledge Discovery
and Data Mining (PAKDD 2008), Osaka, Japan, May 2008.
Slides
4up (pdf, 1.2 MB)
Slides
1up (pdf, 391 KB)
2007:
- A Two-Step Classification Approach to Unsupervised Record
Linkage
Peter Christen
Presentation at the
Sixth Australasian Data Mining Conference
(AusDM 2007), Gold Coast, Australia, December 2007.
Slides
4up (pdf, 1.3 MB)
- Evaluation of a Graduate Level Data Mining Course
with Industry Participants
Peter Christen
Presentation at the
Sixth Australasian Data Mining Conference
(AusDM 2007), Gold Coast, Australia, December 2007.
Slides
4up (pdf, 1.2 MB)
- Data Linkage Research at the ANU
Peter Christen
Invited talk at the The Distillery, Canberra, July 2007.
Slides
8up (pdf.gz, 1.7 MB)
Slides
8up (ps.gz, 1.4 MB)
2006:
- Privacy-Preserving Data Linkage and Geocoding: Current
Approaches and Research Directions
Peter Christen
Presentation at the Workshop on Privacy Aspects of Data Mining (PADM)
held at the IEEE International Conference on Data
Mining (ICDM), Hong Kong, December 2006.
Also presented as a
Departmental
seminar at the ANU
Department of Computer Science, Canberra, December 2006.
Slides
(pdf, 859 KB)
Slides
8up (ps.gz, 1.4 MB)
- A Comparison of Personal Name Matching: Techniques and
Practical Issues
Peter Christen
Presentation at the Workshop on Mining Complex Data (MCD)
held at the IEEE International Conference on Data
Mining (ICDM), Hong Kong, December 2006.
Also presented as a
Departmental
seminar at the ANU
Department of Computer Science, Canberra, December 2006.
Slides
(pdf, 830 KB)
Slides
8up (ps.gz, 1.3 MB)
- Recent Developments in Data Linkage and Research at the
ANU
Peter Christen
Invited talk at the
Australian
Taxation Office, data matching personnel, Canberra,
December 2006.
Slides
10up (ps.gz, 1.6 MB)
- Data Quality Aspects in Data Mining, Data Linkage and
Geocoding
Peter Christen
Invited talk at
Geoscience
Australia, Canberra, November 2006.
Slides
(pdf, 1.7 MB)
Slides
(ps.gz, 1.2 MB)
Slides
9up (ps.gz, 1.2 MB)
- Secure Health Data Linkage and Geocoding: Current
Approaches and Research Directions
Peter Christen and Tim Churches
Presentation at the
National e-Health Privacy and Security
Symposium (ehPASS), Brisbane, October 2006.
Slides
(pdf, 779 KB)
Slides
9up (ps.gz, 363 KB)
- Data Linkage Techniques: Past, Present and Future
Peter Christen
Invited talk at the
Australian
Taxation Office, Canberra, October 2006.
(same set of slides as used for the Analytics Practise
Group presentation, see below).
- Data Linkage Techniques: Past, Present and Future
Peter Christen
Invited talk at the
Canberra
Analytics Practise Group, Canberra, August 2006.
Slides 8up
(pdf, 1.5 MB)
Slides 8up
(ps.gz, 630 KB)
Slides
(pdf, 1.5 MB)
2005:
- Automated Probabilistic Address Standardisation and
Verification
Peter Christen
Presentation at the
fourth Australasian
Data Mining Conference (AusDM 2005), Sydney, December 2005.
Slides
8up (ps.gz, 428 KB)
Slides
(pdf, 892 KB)
Slides
(ps.gz, 425 KB)
- Reflections on COMP2720 (Automating Tools for New Media)
Peter Christen
Departmental
seminar at the ANU
Department of Computer Science, Canberra, November 2005.
Slides
6up (ps.gz, 3.0 MB)
Slides
6up (pdf, 523 KB)
- Recent Developments in Data Linkage Technologies
Peter Christen
Invited talk at the
Data
Linkage Symposium of the Canberra Branch of the
Statistical Society of
Australia, Canberra, September 2005.
Slides
(pdf, 1.7 MB)
Slides 8up
(ps.gz, 690 KB)
- Probabilistic Data Generation for Deduplication and
Data Linkage
Peter Christen
Presentation at the
Sixth
International Conference on Intelligent Data Engineering
and Automated Learning (IDEAL'05), Brisbane, July 2005.
Slides
(pdf, 704 KB)
Slides
(ps.gz, 269 KB)  
Slides 8up
(pdf, 677 KB)
Slides 8up
(ps.gz, 272 KB)
- Probabilistic Deduplication, Data Linkage and Geocoding
Peter Christen
Presentation at the
DAMA Canberra Chapter,
June 2005.
Slides 8up
(pdf, 2.7 MB)
Slides 8up
(ps.gz, 1.3 MB)
- Probabilistic Deduplication, Record Linkage
and Geocoding
Peter Christen
Guest lecture for
MATH1500: ANU Computational Science Undergraduate Seminar,
ANU, May 2005.
Slides 8up
(pdf, 2.1 MB)
Slides 8up
(ps.gz, 989 KB)
- A very short Introduction to Data Mining
Peter Christen
Guest lecture for
COMP3420:
Database Systems, ANU, May 2005.
Slides 8up
(pdf, 74 KB)
Slides 8up
(ps, 136 KB)
Slides 1up
(pdf, 88 KB)
- Febrl - A parallel open source record linkage and geocoding
system
Peter Christen
Presentation at the Data Linkage Workshop,
Australian Bureau of Statistics,
Canberra, April 2005.
Slides 8up
(pdf, 2.4 MB)
Slides 8up
(ps.gz, 1.2 MB)
- A Probabilistic Deduplication, Record Linkage
and Geocoding System
Peter Christen and Tim Churches
Presentation at the
ARC Health Data Mining workshop, University of South
Australia, April 2005.
Slides 4up
(pdf, 854 KB)
Slides 4up
(ps.gz, 389 KB)
Slides
(pdf, 885 KB)
Slides
(ps.gz, 386 KB)
2004:
- A Probabilistic Geocoding System based on
a National Address File
Peter Christen, Tim Churches and Alan Willmore
Presentation at the
Australasian Data Mining Conference,
Cairns, December 2004.
Slides 4up
(pdf, 1.6 MB)
Slides 4up
(ps.gz, 752 KB)
Slides
(pdf, 1.6 MB)
Slides
(ps.gz, 751 KB)
- Febrl - A parallel open source data linkage and geocoding
system
Peter Christen
Presentation at the Open Source Workshop,
Australian Bureau of Statistics,
Canberra, July 2004.
Slides 4up
(pdf, 1.3 MB)
Slides 4up
(ps.gz, 595 KB)
- Febrl - A parallel open source data linkage system
Peter Christen
Presentation at the
PAKDD 2004,
Sydney, May 2004.
Slides
(pdf, 655 KB)
Slides 4up
(pdf, 637 KB)
- Blind data linkage using n-gram similarity comparisons
Peter Christen
Short presentation at the
PAKDD 2004,
Sydney, May 2004.
Slides
(pdf, 510 KB)
Slides 4up
(pdf, 500 KB)
2003:
2002:
- Probabilistic Name and Address Cleaning and
Standardisation
Peter Christen, Tim Churches and Justin Xi Zhu
Presentation at the
Australasian
Data Mining Workshop, Canberra, December 2002.
Slides
4up (ps.gz, 345 KB)
Slides
4up (pdf, 764k KB)
- How Fast is '-fast'? Performance Analysis of KDD
Applications using Hardware Performance Counters on
UltraSPARC-III
Adam Czezowski and Peter Christen
Presentation at the
Australasian
Data Mining Workshop, Canberra, December 2002.
Slides
4up (ps.gz, 335 KB)
Slides
4up (pdf, 745k KB)
- High-Performance Computing Techniques for
Record Linkage
Peter Christen, Tim Churches, Markus Hegland, Kim Lim, Ole M.
Nielsen, Stephen Roberts and Justin Xi Zhu
Presentation at the Australian Health Outcomes
Conference (AHOC-2002), Canberra, July 2002.
Slides 4up
(ps.gz, 1.6 MB)
Slides 4up
(pdf, 1.5 MB)
- Parallel Techniques for High-Performance
Record Linkage (Data Matching)
Peter Christen
Seminar at the ANU Department of Computer Science,
Canberra, June 2002.
Slides 4up
(ps.gz, 535 KB)
Slides 4up
(pdf, 1.1 MB)
- Parallel Computing Techniques for High-Performance
Probabilistic Record Linkage
Peter Christen, Tim Churches, Markus Hegland, Kim Lim, Ole M.
Nielsen and Stephen Roberts
Presentation at the Symposium on Health Data
Linkage, Syndey, March 2002.
Slides
(ps.gz, 1.5 MB)
Slides
(pdf, 654 KB)
- Performance Analysis of KDD Applications using Hardware
Event Counters
Peter Christen and Adam Czezowski Presentation at the
CAP Workshop 2002, Fujitsu, Kawasaki, Japan,
6 February 2002.
Slides
(ps.gz, 57 KB)
Slides
(pdf, 77 KB)
2001:
- High Performance Computing and Data Mining
Peter Christen Presentation at the AEA Data Mining
Workshop, Australasian Epidemiological Association, 10th Annual
Scientific Meeting, Sydney, 28 September, 2001.
Slides (ps.gz, 963 KB
Slides (pdf, 947 KB)
- Data Mining at the Australian National University
Peter Christen Presentation at the
Department of Computer
Science,
University of Basel,
Switzerland, September 2001.
- DMtools - Open Source Software for Database Mining
Peter Christen Presentation at the
Workshop on Database Support for KDD (at the
PKDD'2001
Conference), Freiburg, Germany, September 2001.
Slides (ps.gz, 283 KB)
Slides (pdf, 1.4 MB)
- High Performance Computing and Data Mining
Peter Christen Presentation a the EPI-SIG Health
Data Mining Seminar, Australian Museum, Sydney, 25 May,
2001.
Slides (ps.gz, 1.0 MB)
Slides (pdf, 1.1 MB)
2000:
- Application of Parallel Computing in Data Mining
Peter Christen
Seminar at the Suranaree
University of Technology, December 2000.
- Parallel Computing and Message Passing
Peter Christen.
Two-days course at the Suranaree
University of Technology, October 2000.
- Data Mining at the ANU
Peter Christen
Presentation at the ADFA/ANU Machine Learning meeting,
ANU, Canberra, September 2000.
- ACSys CRC - Data Mining Tools
Peter Christen and Ole Nielsen
Presentation and Demonstration for the ACSys CRC
Data Mining research group, ANU, Canberra, August 2000.
- Parallel Algorithms in Data Mining - The
ANU CSL Data Mining Approach
Peter Christen
Seminar at the Department of Computer Science, Australian
National University, Canberra, July 2000.
- Parallel Algorithms for Data Mining
Peter Christen
Seminar at the School for Information Studies, Charls Sturt
University, Wagga Wagga, May 2000.
Slides
(pdf, 1.7 MB)
Slides
(ps.gz, 1.5 MB)
1999:
- Parallelization of a Finite Element Surface Fitting
Algorithm for Data Mining
Peter Christen, Irfan Altas, Markus Hegland, Stephen Roberts,
Kevin Burrage and Roger Sidje
CTAC-99 Conference, Canberra, September 1999.
Slides
(ps.gz, 614 KB)
- A Parallel Iterative Linear System Solver
with Dynamic Load Balancing
Peter Christen
CTAC-99 Conference, Canberra, September 1999.
Slides (ps.gz, 218 KB)
Last modified: 29/04/2013, 07:33
|