Bridging National and International Legal Data: Two Projects Based on the Japanese Legal Standard XML Schema for Comparative Law Studies

This paper presents an integrated framework for computational comparative law that connects Japanese statutes to international standards via a JLS-to-Akoma Ntoso conversion pipeline and employs multilingual semantic techniques to identify and visualize cross-jurisdictional legal correspondences.

Makoto Nakamura

Published 2026-03-17
📖 5 min read🧠 Deep dive

Imagine you are trying to compare the rulebooks of three different sports teams: one from Japan, one from France, and one from Germany. Each team writes their rules in a different language, uses different formatting, and organizes their chapters in unique ways. Trying to find out, "Do they all have the same rule about fouling a player?" is a nightmare for a human researcher. You'd need to be a polyglot expert who knows the history and culture of all three sports just to make sense of it.

This paper describes a project that builds a digital translator and a smart search engine to solve this problem, specifically for laws. The researchers, led by Makoto Nakamura, built a two-step system to connect legal data from Japan with the rest of the world.

Here is the breakdown of their "magic trick" using simple analogies:

The Big Problem: The "Tower of Babel" of Laws

Laws are like massive libraries. In Japan, the books are organized one way (using a system called JLS). In the rest of the world, most libraries use a different, international standard called Akoma Ntoso (AKN).

  • The Issue: Even if you speak the language, you can't easily compare the books because the "shelves" and "labels" don't match. A "Chapter 1" in Japan might look like "Section A" in France.
  • The Goal: Build a bridge so computers can read Japanese laws and compare them with French, German, or Korean laws automatically.

Step 1: The "Universal Adapter" (Project 1)

The Analogy: Think of Japanese laws as a phone charger with a unique Japanese plug. The rest of the world uses a standard international socket (AKN). You can't plug the Japanese charger into the world's power grid without an adapter.

What they did:
The team built a conversion pipeline (a digital adapter). They took the Japanese legal data and rewrote it so it fits the international standard.

  • They mapped every Japanese "Law," "Article," and "Paragraph" to its international equivalent.
  • The Result: Suddenly, Japanese laws look exactly like the laws of other countries to a computer. They are now "structurally interoperable." The Japanese library has been re-shelved to match the global library's layout.

Step 2: The "Smart Librarian" (Project 2)

The Analogy: Now that the books are on the same shelves, you still need to find the content that matches. Imagine you are looking for a specific recipe. You don't just look for the exact same words; you look for the same idea. Maybe one book says "Add a pinch of salt," and another says "Season lightly." A human knows these are the same. A computer usually needs help to realize that.

What they did:
They built a Semantic Search Engine using advanced AI (specifically, a type of "brain" called BERT).

  • The Brain: This AI doesn't just count words; it understands meaning. It knows that "unmarried fathers" in Japanese law and "fathers of children born out of wedlock" in French law are talking about the same concept, even though the words are totally different.
  • The Process:
    1. Vector Search: The AI turns every law article into a "mathematical point" in a giant 3D space. Laws with similar meanings end up close to each other, like friends standing in a circle.
    2. The Filter (FAISS): Since there are thousands of laws, the AI quickly scans the crowd to find the top candidates that are standing near the Japanese law.
    3. The Referee (Cross-Encoder): The AI takes a second look at the top candidates to make sure they are actually a good match, acting like a strict referee checking the rules.
    4. The Map: Finally, it draws a network map. Imagine a spiderweb where the Japanese law is in the center, and lines connect it to the matching laws in Korea and France.

Why This Matters

Before this, comparing laws was like trying to compare two different languages by hand, relying on a few experts who spoke both. It was slow, expensive, and limited.

This project creates a foundation for "Computational Comparative Law."

  • It's not replacing lawyers: It's not saying the AI is smarter than a human judge.
  • It's a super-tool: It acts like a high-powered telescope. It scans millions of laws instantly and says, "Hey, look! These three laws from three different countries seem to be talking about the same thing."
  • The Benefit: This helps researchers, policymakers, and lawyers spot patterns, understand how laws spread across borders, and find similarities they might have missed.

The Current Status

Think of this system as a prototype car. It's not a fully finished luxury vehicle yet, but it proves the engine works.

  • It successfully connected the "plugs" (Project 1).
  • It successfully found "meaningful matches" between Japanese, Korean, and French laws (Project 2).
  • Next Steps: The team needs to test it more, get human experts to verify the matches, and expand it to cover more countries and more types of laws.

In a nutshell: They built a universal translator for legal structures and a smart AI librarian that can find the "soul" of a law, regardless of what language it's written in. This turns the chaotic, isolated world of national laws into one giant, connected, searchable network.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →