Challenges in Android Data Disclosure: An Empirical Study

Through a survey and analysis of online discussions involving 683 developers, this paper identifies that while Android developers are confident in knowing what data their apps collect, they struggle to accurately categorize and disclose it within Google Play’s Data Safety Section due to complex requirements, classification difficulties, and fear of app rejection.

Original authors: Mugdha Khedkar, Michael Schlichtig, Mohamed Soliman, Eric Bodden

Published 2026-04-27
📖 3 min read☕ Coffee break read

This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are a chef opening a new restaurant. To keep customers safe, the government requires you to post a "Nutrition & Ingredient Label" on your menu. This label must tell customers exactly what’s in the food: Is there peanuts? Is there gluten? Is it organic?

The problem? The government’s instructions for the label are written in confusing legal jargon, and they don't tell you if the spices used by your vegetable supplier (your "third-party suppliers") contain allergens. If you make a mistake, they might shut your restaurant down immediately.

This research paper is a study of exactly that struggle, but for Android app developers.

The Core Problem: The "Data Safety" Label

When you download an app on your phone, you often see a "Data Safety" section. This is a digital nutrition label that tells you, "This app collects your location, your email, and your contacts."

Google (the "government" in this analogy) requires every developer to fill out a form to create these labels. If the label is wrong, Google can kick the app off the Play Store.

What the Researchers Found

The researchers talked to hundreds of developers to see how they handle this "labeling" task. Here is what they discovered, explained through metaphors:

1. The "Guesswork" Method (Manual Classification)
Instead of having a high-tech scanner that tells them exactly what’s in their "food," most developers are just looking at their ingredients and guessing.

  • The Analogy: It’s like a chef trying to guess if a pre-made sauce has salt in it just by looking at it. Some developers don't even bother labeling certain ingredients because they think, "It's just a simple salad, it doesn't need a label," even though the dressing might contain hidden data.

2. The "Confidence Gap"
The researchers found a strange psychological phenomenon: Developers are very confident they know what their app does, but they are terrified of the labeling process itself.

  • The Analogy: It’s like a chef who knows exactly how they cooked the steak, but becomes incredibly nervous the moment they have to write it down on the official government form because they are afraid of using the wrong technical term and getting fined.

3. The "Mystery Ingredient" Problem (Third-Party Libraries)
This was the biggest headache. Most apps aren't built from scratch; they use "pre-made ingredients" called SDKs (software kits) provided by other companies (like Google Maps or AdMob for ads).

  • The Analogy: Imagine you are making a soup, but you bought a "Secret Spice Mix" from a different company. You don't know exactly what's in that mix. If that spice mix contains peanuts, you are the one responsible for telling the customer, even though you didn't put the peanuts in the kitchen yourself. Developers are struggling to report data that they didn't even collect themselves!

4. The "Fear of the Inspector" (App Rejections)
Developers are living in fear of "App Rejection."

  • The Analogy: They are constantly worried that a "Health Inspector" (Google) will walk in, find a tiny discrepancy between the menu label and the actual kitchen practices, and instantly padlock their doors.

The Bottom Line

The paper concludes that the current system is a bit of a mess. Developers aren't necessarily trying to lie to you; they are just overwhelmed by confusing rules, "mystery ingredients" from third parties, and a lack of good tools to help them do the job right.

The researchers' suggestion? We need better "digital kitchen scales" (automated tools) and much clearer "recipe books" (guidelines) so that the labels you see on your phone are actually accurate and trustworthy.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →