The figures below are drawn from Julius Panero and Martin Zelnik's foundational reference work, Human Dimension and Interior Space: A Source Book of Design Reference Standards (Whitney Library of Design / Watson-Guptill Publications, New York, 1979). The book applies anthropometric data—measurements of the human body in motion and at rest—to the practical challenges of interior space planning. These four diagrams focus on residential and dining environments: the first two address lounge seating, illustrating the dimensional requirements for sofas accommodating two and three occupants, and the clearance zones needed around corner lounge arrangements to allow comfortable circulation. The latter two turn to dining and kitchen spaces, specifying the minimum clearances around a breakfast table for four, and the sitting, circulation, and standing activity zones required between a dining table and a buffet or wall. Together, they exemplify the book's core argument: that well-designed interior space is not a matter of taste alone, but of precise, human-centered measurement.
In practice, layout requirements are not universal. A mobility-limited user and a frequent entertainer sharing the same room type will impose fundamentally different demands on furniture placement, clearance, and zone allocation. We capture this variability through activity × persona combinations derived from the user's functional specifications, which together determine which constraints are relevant and how their thresholds are parameterized for a given scene.
We ground our constraints taxonomy in established interior design literature and organize constraints into four categories: Spatial, Ergonomic, Activity, and Environmental.
To verify each constraint at runtime, our system employs 29 specialized tools across three types. Numeric and geometric tools compute quantitative measures directly from scene geometry — overlap ratios, traversal paths, clearance distances, and obstruction percentages. LLM query tools apply language model reasoning over structured scene data for contextual judgments that resist straightforward numeric formulation, such as workflow order, activity support, and glare risk. VLM tools interpret rendered images for holistic visual assessments, used here for top-down balance evaluation. Constraints are organized into six priority tiers (T1–T6): lower-tier constraints must be satisfied before higher-tier ones are evaluated, ensuring that geometric validity is confirmed before ergonomic fit, and ergonomic fit before activity-level behavior.
| Constraint | Tools & returns | Tier |
|---|---|---|
| Spatial | ||
S1Geometry Validity |
boundary_check()— Room containment (bool)
bbox_collision()— Overlap ratio (%)
|
T1 |
S2Boundary & Attachment |
contact_check()— Surface attachment (bool)
wall_angle_check()— Wall angle (°)
|
T1 |
S3Spatial Relationships |
object_exist()— Object presence (bool)
object_info()— Position & dimensions (l,w,h,x,y,z)
|
T2 |
S4Scale & Proportion |
size_ratio()— Room-relative size (%)
size_check()— Size plausibility
|
T2 |
S5Visual Composition |
visual_balance_check()— Top-down visual balance
|
T5 |
| Ergonomic | ||
E1Circulation |
pathfinding()— Traversal path (waypoints) or null
path_width()— Min clearance (m) & bottleneck position
|
T2 |
E2Interaction Clearance |
articulation_zone()— Swing arc clearance (m)
chair_clearance()— Chair pull-out clearance (m)
|
T4 |
E3Reachability |
reach_check()— Reachability for user profile
|
T4 |
E4Body Fit & Posture |
posture_check()— Posture suitability for user
|
T4 |
| Activity | ||
A1Activity Zone |
free_floor_area()— Zone free area (m²)
object_in_zone()— Object placement in zone
activity_support_check()— Activity support quality
|
T3 |
A2Sightlines & Privacy |
inbetween_check()— Sightline obstruction
|
T3 |
A3Workflow Sequencing |
total_path_length()— Sequence path length (m)
workflow_check()— Workflow order quality
|
T5 |
A4Multi-activity Compat. |
multi_activity_check()— Simultaneous activity support
|
T5 |
| Environmental | ||
N1Natural Light Access |
window_obs_ratio()— Window obstruction ratio (%)
|
T6 |
N2Glare Prevention |
screen_window_info()— Screen–window angle & distance
glare_check()— Glare risk
|
T5 |
N3Acoustic Separation |
zone_distance()— Zone separation (m)
acoustic_check()— Acoustic risk
|
T5 |
N4Ventilation & Thermal |
vent_obs_ratio()— Vent obstruction ratio (%)
distance_check()— Safe thermal distance (bool)
|
T6 |
Given a functional prompt capturing the user's living needs, the system proceeds through two main stages. In the Initialization stage, the prompt is parsed into a structured constraint list and a reformulated scene description; the room structure is then generated and verified by the user before an initial furniture layout is produced. Because LLM-generated layouts at this stage frequently suffer from overlaps, violated adjacencies, and spatially plausible but practically unusable configurations, they serve as a starting point rather than a final result. The second stage, Constraints-based Evaluation and Refinement, addresses this by iteratively assessing the layout against all 17 constraints in priority order — invoking the specialized tools described above, generating a targeted adjustment for each unsatisfied constraint, and re-evaluating before moving on — until the layout satisfies functional, ergonomic, activity, and environmental requirements end-to-end.
We evaluate Function2Scene through a two-alternative forced-choice (2AFC) perceptual study with 30 participants recruited via Prolific, comparing against Holodeck, iDesign, and LayoutVLM across 30 real interior design cases spanning 10 room types and 30 distinct occupant personas sourced from Architectural Digest. Participants were shown two rendered layouts side by side and asked to select the more functional one for the described occupant. Across all baselines and prompt conditions, participants preferred Function2Scene at an aggregate rate of 94.3%, with the margin reaching as high as 98.9% against iDesign under our parsed prompt. An ablation study further identifies the evaluation tool set as the critical enabler: applying iterative refinement without tools performs worse than applying no refinement at all, confirming that grounded spatial feedback, not iteration alone, drives the improvement.
| Method | Prompt | % Preferred |
|---|---|---|
| Holodeck | Functional | 92.2 |
| Parsed | 88.9 | |
| iDesign | Functional | 94.4 |
| Parsed | 98.9 | |
| LayoutVLM | Functional | 96.7 |
| Parsed | 94.4 | |
| Overall | — | 94.3 |
| Prompt | Iterative | Tools | % Preferred |
|---|---|---|---|
| Functional | ✗ | ✗ | 83.3 |
| Parsed | ✗ | ✗ | 83.3 |
| Functional | ✓ | ✗ | 78.9 |
| Parsed | ✓ | ✗ | 80.0 |
| Parsed | ✓ | ✓ | Function2Scene |
We invite readers to explore the full set of 30 generated scenes in our interactive viewer, where each layout can be inspected from any angle alongside its parsed prompt and constraint list.