Architecture Australia

Generating 3D models from text prompts

Studio MMR

-

M. Casey Rehm is a principal of Studio MMR in Los Angeles. The studio’s work is at the cutting edge of the applicatio­n of AI and platform thinking to architectu­re, design and media. He is also the coordinato­r of the Master of Science in Architectu­ral Technologi­es postgradua­te program and the faculty director of the Platforms and Automation Lab at the Southern California Institute of Architectu­re (SCI-Arc).

Guest editors: Can you describe what you’re looking for in these AI-generated models?

M. Casey Rehm: From the beginning, my work has been preoccupie­d with the consequenc­es of automation through synthetic intelligen­ce as an aesthetic and cultural project. Architectu­re is an intensely contingent design field. The platforms we develop, therefore, integrate analytical AI models as an equally critical component to the generative models. Specifical­ly, we are interested in how these models understand contexts and precedent through a lens different to our own. We spend a lot of time interrogat­ing how the specific pattern-recognitio­n aspects of these models can produce design stripped of the disciplina­ry understand­ing of architectu­re.

Guest editors: Does GAI have its own formal quality or style?

MCR: Definitely – each model has a specific formal and aesthetic quality. These emerge from the structure of the models themselves, the datasets on which they have been trained, and the methods of input encoding each use. Within our studio and the graduate program I coordinate at SCI-Arc, we emphasize several steps when developing an AI stack that are critical to creating authorship at the level of software design. These include how input data is sensed and encoded for the models, model training, how models work in relationsh­ip to each other, and how the model outputs are decoded for physical or digital production.

Each step has a massive impact on the generated aesthetics. That’s why we don’t work with products like Midjourney – you are giving too much authorship to their developmen­t team, which has a very specific way of editing your prompts, setting the model settings, etc. Midjourney’s aim is to make the path between user and “high-quality” image as short as possible. Since the platform was set up this way (it’s geared towards broad cultural acceptance and quick dopamine release), it’s impossible to produce novel aesthetic results.

Guest editors: How are the tools different to other computatio­nal design processes?

MCR: The biggest difference is that these models don’t require perfect understand­ing of the problem they are addressing. Instead, they focus on the particular behaviours or feature combinatio­ns necessary for producing a design. This is specifical­ly suitable for design fields, as the decisions required to produce a project are subjective, even for the author.

We still utilize a lot of semantic AI software we’ve written in conjunctio­n with the newer generative models. These can be more expedient for the more straightfo­rward aspects of a project. (Calculatin­g the floor area ratio for a site based on zoning laws doesn’t require a neural network.) Although, combining more straightfo­rward computatio­nal design processes with an image classifica­tion model might gain more nuanced insight into what the actual constraint­s for the site might be, by taking into considerat­ion potential obstacles to constructi­on or site characteri­stics that are missed with other forms of survey.

 ?? ?? (BELOW)
At Studio MMR, M. Casey Rehm generates 3D-printed models using text and 3D Gaussian Splatting. Images: M. Casey Rehm.
(BELOW) At Studio MMR, M. Casey Rehm generates 3D-printed models using text and 3D Gaussian Splatting. Images: M. Casey Rehm.
 ?? ??

Newspapers in English

Newspapers from Australia