Function2Scene 3D Indoor Scene Layout from Functional Specifications

A framework for generating 3D indoor layouts from functional specifications.

For the best experience, view the interactive viewer on a laptop, PC, or tablet — the full interface can be hard to navigate on a phone.

We present Function2Scene, a framework for generating 3D indoor layouts from functional specifications. Given a detailed functional specification, our method decomposes it into functional design constraints, which are then used to iteratively evaluate and refine a generated scene. Please refer to the paper and viewer for the full input prompt and more detailed visualizations.

Interactive Results

Interactive Demo

Loading demo...

Design Constraints

Constraints Taxonomy

The figures below are drawn from Julius Panero and Martin Zelnik's foundational reference work, Human Dimension and Interior Space: A Source Book of Design Reference Standards (Whitney Library of Design / Watson-Guptill Publications, New York, 1979). The book applies anthropometric data—measurements of the human body in motion and at rest—to the practical challenges of interior space planning. These four diagrams focus on residential and dining environments: the first two address lounge seating, illustrating the dimensional requirements for sofas accommodating two and three occupants, and the clearance zones needed around corner lounge arrangements to allow comfortable circulation. The latter two turn to dining and kitchen spaces, specifying the minimum clearances around a breakfast table for four, and the sitting, circulation, and standing activity zones required between a dining table and a buffet or wall. Together, they exemplify the book's core argument: that well-designed interior space is not a matter of taste alone, but of precise, human-centered measurement.

Sofa dimensions — **Fig. 1** Dimensional standards for two- and three-seat sofas, showing seat depth, hip width, and overall sofa length based on female occupant measurements.

Corner lounge clearance — **Fig. 2** Corner lounge seating arrangement with a 24-inch diagonal circulation zone, illustrating clearance requirements for an L-shaped configuration.

Breakfast table clearance — **Fig. 3** Minimum clearance and circulation requirements for a breakfast or kitchen table seating four, including place-setting and shared access zones.

Dining table to buffet clearance — **Fig. 4** Clearance zones between a dining table and buffet, defining the sitting zone, circulation path, and standing activity space required for comfortable use.

In practice, layout requirements are not universal. A mobility-limited user and a frequent entertainer sharing the same room type will impose fundamentally different demands on furniture placement, clearance, and zone allocation. We capture this variability through activity × persona combinations derived from the user's functional specifications, which together determine which constraints are relevant and how their thresholds are parameterized for a given scene.

We ground our constraints taxonomy in established interior design literature and organize constraints into four categories: Spatial, Ergonomic, Activity, and Environmental.

S1Geometry validity — every piece fits within the room boundary without overlapping neighbors, unless an explicit containment relationship is defined.
S2Boundary & attachment — floor items rest on the floor plane, wall-mounted objects attach to the correct surface, and large case goods align to the nearest wall.
S3Spatial relationships — functionally paired objects are placed in proximity with appropriate relative orientation.
S4Scale & proportion — furniture size is commensurate with room dimensions, preventing pieces from dominating or failing to define the space.
S5Visual composition — focal point orientation, visual balance, and alignment ensure the arrangement reads as intentional rather than arbitrary.

E1Circulation — primary pathways maintain a minimum clear width, with elevated thresholds for mobility aid users.
E2Interaction clearance — articulated elements have full action zones free of obstruction; seating has adequate pull-out space.
E3Reachability — frequently used objects fall within the user's operational height range, accounting for seated vs. standing use.
E4Body fit & posture — work surface heights, seat dimensions, and monitor distances conform to anthropometric standards.

A1Activity zone — each primary activity has a dedicated zone of sufficient size, equipped with relevant furniture.
A2Multi-activity compatibility — the layout supports zone transformation without requiring full reorganization.
A3Sightlines & privacy — tasks requiring surveillance get open sightlines; others get visual shielding.
A4Workflow sequencing — objects are arranged in order of use, avoiding backtracking and cross-circulation.

N1Natural light access — furniture does not block windows.
N2Glare prevention — screens are positioned to avoid direct glare from windows based on angle and distance.
N3Acoustic separation — sensitive zones are buffered from noise sources by distance or intervening furniture.
N4Ventilation & thermal — temperature-sensitive activities are kept away from drafts, heat sources, and obstructed vents.

Design Constraints

Evaluation Tools

To verify each constraint at runtime, our system employs 29 specialized tools across three types. Numeric and geometric tools compute quantitative measures directly from scene geometry — overlap ratios, traversal paths, clearance distances, and obstruction percentages. LLM query tools apply language model reasoning over structured scene data for contextual judgments that resist straightforward numeric formulation, such as workflow order, activity support, and glare risk. VLM tools interpret rendered images for holistic visual assessments, used here for top-down balance evaluation. Constraints are organized into six priority tiers (T1–T6): lower-tier constraints must be satisfied before higher-tier ones are evaluated, ensuring that geometric validity is confirmed before ergonomic fit, and ergonomic fit before activity-level behavior.

Numeric / geometric tool LLM query tool VLM tool T1 evaluated first → T6 last

Constraint	Tools & returns	Tier
Spatial
S1Geometry Validity	boundary_check()— Room containment (bool) bbox_collision()— Overlap ratio (%)	T1
S2Boundary & Attachment	contact_check()— Surface attachment (bool) wall_angle_check()— Wall angle (°)	T1
S3Spatial Relationships	object_exist()— Object presence (bool) object_info()— Position & dimensions (l,w,h,x,y,z)	T2
S4Scale & Proportion	size_ratio()— Room-relative size (%) size_check()— Size plausibility	T2
S5Visual Composition	visual_balance_check()— Top-down visual balance	T5
Ergonomic
E1Circulation	pathfinding()— Traversal path (waypoints) or null path_width()— Min clearance (m) & bottleneck position	T2
E2Interaction Clearance	articulation_zone()— Swing arc clearance (m) chair_clearance()— Chair pull-out clearance (m)	T4
E3Reachability	reach_check()— Reachability for user profile	T4
E4Body Fit & Posture	posture_check()— Posture suitability for user	T4
Activity
A1Activity Zone	free_floor_area()— Zone free area (m²) object_in_zone()— Object placement in zone activity_support_check()— Activity support quality	T3
A2Sightlines & Privacy	inbetween_check()— Sightline obstruction	T3
A3Workflow Sequencing	total_path_length()— Sequence path length (m) workflow_check()— Workflow order quality	T5
A4Multi-activity Compat.	multi_activity_check()— Simultaneous activity support	T5
Environmental
N1Natural Light Access	window_obs_ratio()— Window obstruction ratio (%)	T6
N2Glare Prevention	screen_window_info()— Screen–window angle & distance glare_check()— Glare risk	T5
N3Acoustic Separation	zone_distance()— Zone separation (m) acoustic_check()— Acoustic risk	T5
N4Ventilation & Thermal	vent_obs_ratio()— Vent obstruction ratio (%) distance_check()— Safe thermal distance (bool)	T6

Method

Pipeline

Given a functional prompt capturing the user's living needs, the system proceeds through two main stages. In the Initialization stage, the prompt is parsed into a structured constraint list and a reformulated scene description; the room structure is then generated and verified by the user before an initial furniture layout is produced. Because LLM-generated layouts at this stage frequently suffer from overlaps, violated adjacencies, and spatially plausible but practically unusable configurations, they serve as a starting point rather than a final result. The second stage, Constraints-based Evaluation and Refinement, addresses this by iteratively assessing the layout against all 17 constraints in priority order — invoking the specialized tools described above, generating a targeted adjustment for each unsatisfied constraint, and re-evaluating before moving on — until the layout satisfies functional, ergonomic, activity, and environmental requirements end-to-end.

Results

Quantitative Results

We evaluate Function2Scene through a two-alternative forced-choice (2AFC) perceptual study with 30 participants recruited via Prolific, comparing against Holodeck, iDesign, and LayoutVLM across 30 real interior design cases spanning 10 room types and 30 distinct occupant personas sourced from Architectural Digest. Participants were shown two rendered layouts side by side and asked to select the more functional one for the described occupant. Across all baselines and prompt conditions, participants preferred Function2Scene at an aggregate rate of 94.3%, with the margin reaching as high as 98.9% against iDesign under our parsed prompt. An ablation study further identifies the evaluation tool set as the critical enabler: applying iterative refinement without tools performs worse than applying no refinement at all, confirming that grounded spatial feedback, not iteration alone, drives the improvement.

2AFC comparison with baselines

% of trials where participants preferred our method

Method	Prompt	% Preferred
Holodeck	Functional	92.2
	Parsed	88.9
iDesign	Functional	94.4
	Parsed	98.9
LayoutVLM	Functional	96.7
	Parsed	94.4
Overall	—	94.3

Ablation study

% of trials where participants preferred our full method over each ablation

Prompt	Iterative	Tools	% Preferred
Functional	✗	✗	83.3
Parsed	✗	✗	83.3
Functional	✓	✗	78.9
Parsed	✓	✗	80.0
Parsed	✓	✓	Function2Scene

Results

Qualitative Results

We invite readers to explore the full set of 30 generated scenes in our interactive viewer, where each layout can be inspected from any angle alongside its parsed prompt and constraint list.

Open Viewer