GRIHA: Synthesizing 2-Dimensional Building Layouts from Images Captured using a Smart Phone

The paper proposes GRIHA, a fast and efficient framework that utilizes SLAM technology embedded in standard mobile ARCore libraries to generate 2D building layouts from simple RGB smartphone images, overcoming the hardware and occlusion limitations of existing methods.

Shreya Goyal, Naimul Khan, Chiranjoy Chattopadhyay, Gaurav Bhatnagar

Published 2026-02-20
📖 5 min read🧠 Deep dive

Imagine you are a detective trying to solve a mystery: "What does this room actually look like on paper?"

Usually, to draw a floor plan (a 2D map of a room), you have to do one of two tedious things:

  1. The Old School Way: Walk around with a tape measure, write down every inch, and then spend hours drawing it on a computer.
  2. The High-Tech Way: Use a special, expensive camera that can "see" depth (like a robot's eyes) or take a perfect, 360-degree panoramic photo where nothing blocks the view.

The Problem: Most of us don't have robot eyes or expensive cameras. And if you try to take a 360-degree photo in a messy room full of furniture, the sofa blocks the wall, the table hides the corner, and the photo becomes useless.

The Solution: Enter GRIHA (pronounced like "Gree-ha," meaning "Home" in Sanskrit). It's a new app that lets you create a perfect floor plan using just a regular smartphone and a few quick photos.

Here is how GRIHA works, broken down into simple analogies:

1. The "Magic Glasses" (ARCore & SLAM)

Imagine you are wearing a pair of magic glasses that know exactly where you are standing in the room, even if you are just walking around.

  • How it works: GRIHA uses a technology built into modern Android phones called ARCore. It's like a GPS for inside a building. As you walk and take photos, the phone tracks your movement (how far you walked, how much you turned) using its internal sensors. It builds a "mental map" of your path without needing any special hardware.

2. The "Imagination Engine" (Depth Estimation)

A normal phone camera takes a flat, 2D picture. It doesn't know how far away the wall is.

  • The Analogy: Imagine looking at a painting of a mountain. A normal eye sees a flat image. But GRIHA has a super-powered AI brain that looks at the flat photo and says, "I know this looks flat, but based on the shadows and textures, I can imagine how deep this room is."
  • The Tech: It uses a pre-trained AI (a neural network) to guess the "depth map" of the room. It turns a flat photo into a 3D mental model, estimating how far the furniture is from the wall.

3. The "Puzzle Solver" (3D Reconstruction)

Now, the app has a bunch of flat photos and a guess about how deep they are. It needs to build a 3D model.

  • The Analogy: Think of it like building a 3D sculpture out of clay. You take a photo of a corner, and the app sculpts a little piece of 3D clay for that corner. You take a photo of the door, it sculpts the door. It does this for every photo you take.
  • The Twist: Because you took photos from different angles, the app uses your walking path (from Step 1) to stitch these 3D clay pieces together into one big, floating 3D model of the room.

4. The "Straightener" (Regularization)

Real life is messy. Walls might look slightly crooked in a photo because of the camera angle. But in the real world, walls are usually straight and meet at perfect 90-degree angles (like a grid).

  • The Analogy: Imagine you drew a room on a piece of paper, but your hand shook, so the walls are wobbly. GRIHA is like a magical ruler that comes in, grabs the wobbly lines, and snaps them perfectly straight. It forces the walls to be straight and the corners to be perfect right angles, assuming the room follows "Manhattan rules" (like a city grid).

5. The "Door Detective" (Door Placement)

Finally, the app looks at your photos to find the doors.

  • The Analogy: It's like a game of "Where's Waldo?" but for doors. It scans the images, spots the door, and then calculates exactly where that door should go on your new, straightened floor plan. It places the door symbol in the exact right spot relative to the walls.

Why is this a Big Deal?

  • No Special Gear: You don't need a $2,000 depth camera. Your old smartphone works.
  • Handles Messy Rooms: If you have a couch blocking a corner, other apps fail because they can't "see" the wall. GRIHA takes a photo of the other side of the room, figures out the geometry, and fills in the missing piece. It's like solving a puzzle even when some pieces are hidden.
  • Super Fast: Instead of walking around the whole room scanning everything (which takes 10 minutes), you just snap 4 photos (one from each corner) and the app does the rest in seconds.

The Result

The paper tested this on real offices and labs. They found that GRIHA was just as accurate as the expensive apps that require you to walk around and manually measure everything, but it was much easier to use and didn't get confused by furniture blocking the view.

In short: GRIHA turns your smartphone into a magic architect that can draw your house's blueprint just by looking at a few snapshots.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →