FilterLDPSyn: Locally Differentially Private Data Synthesis Based on Measurements Filtering

Meifan Zhang; Dihang Deng; Lihua Yin

doi:10.26599/BDMA.2025.9020061

AI Chat Paper

Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.

Chat more with AI

| Sign up

Browse by Subject

Search for peer-reviewed journals with full access.

Journals A - Z

About Us

Discover the SciOpen Platform and Achieve Your Research Goals with Ease.

About Us

Publish with Us

Support

Search articles, authors, keywords, DOl and etc.

Published Date

Reset Search

{{expandStatus?'Exit ':''}}Advanced Search

Journals A - Z

About Us

Publish with Us

Support

PDF (2 MB)

Cite

EndNote(RIS) BibTeX

Collect

Submit Manuscript

AI Chat Paper

Show Outline

Outline

Show full outline

Hide outline

Outline

Show full outline

Hide outline

Open Access

FilterLDPSyn: Locally Differentially Private Data Synthesis Based on Measurements Filtering

Meifan Zhang(

), Dihang Deng, Lihua Yin

Cyberspace Institute of Advanced Technology, Guangzhou University, Guangzhou 510006, China

Show Author Information

Abstract

Data synthesis under Local Differential Privacy (LDP) presents a promising approach for private data analysis and sharing, as it enables the execution of all analysis tasks on raw data without the need for a trusted aggregator. The select-measure-generate paradigm of data synthesis under Differential Privacy (DP) introduces specific challenges in the context of LDP, particularly because the noise inherent to LDP is significantly greater than that of DP, especially in high-dimensional datasets. The “select” step involves calculating the correlations between attributes to identify important marginal measurements (attribute pairs), while the “measure” step aims to estimate the frequency distribution of each selected marginal under LDP. However, the utility of both the correlation and frequency estimation for multidimensional data is often unsatisfactory under LDP, as the utility of data analysis tasks typically declines with an increasing number of dimensions. To address these issues, we propose a two-stage method, named FilterLDPSyn. In Stage 1, it filters out ineffective measurements based on one-dimensional frequency and entropy estimations under LDP. In Stage 2, it enhances the utility of the distribution by iteratively collecting two-dimensional values and restoring consistency between one- and two-dimensional distributions. Experimental results demonstrate the superiority of our proposed method over existing approaches.

Keywords

Local Differential Privacy (LDP)data synthesis correlation estimation distribution estimation

References

【1】

Crossref Google Scholar

Big Data Mining and Analytics

Volume 9 Issue 3,
June 2026

Pages 653-671

DOI: 10.26599/BDMA.2025.9020061

	{{item.num}}
{{version.versionName}} Author Response
{{version.versionName}} Review comment

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Cite this Report

. . , , {{reviewData.reportCite.doi}}

Cite this article:

Zhang M, Deng D, Yin L. FilterLDPSyn: Locally Differentially Private Data Synthesis Based on Measurements Filtering. Big Data Mining and Analytics, 2026, 9(3): 653-671. https://doi.org/10.26599/BDMA.2025.9020061

437

Views

Downloads

Crossref

Web of Science

Scopus

CSCD

Google Scholar
Citation

Received: 20 February 2025

Revised: 13 May 2025

Accepted: 16 May 2025

Published: 01 June 2026

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).