Methodology · How LiveArt's AI Works

1.Overview

This document describes the methodologies behind LiveArt's quantitative and qualitative outputs. It begins with the LiveArt Estimate™ (LAE) — our core valuation model — and then covers the analytics built on top of it: the Historical View, the forward-looking Forward View, the Outlook that combines the two, price momentum, similarity, and the market indices.

Our stance is the opposite of black-box AI: the methods are published, the limitations are stated, and every estimate carries a confidence range. In a market where any valuation can be scrutinised by a specialist, a client, or a court, we believe transparency is the only responsible approach.

2.LiveArt Estimate (LAE)

LiveArt Estimate™ answers one question — what is this artwork worth right now? — by separating it into three distinct sub-questions, each answered by a different model. The three components compose into a single price with a confidence range.

Base price

COMPONENT 1 · BASE · XGBoost regression

The intrinsic value of an artwork independent of market timing. Trained on 100+ features per artwork: artist, medium, size, period, provenance, and more.

Artwork-specific trend

COMPONENT 2 · ARTWORK · Gradient-boosted decision trees

Captures how specific work categories — by artist, medium, period, size — drift relative to the overall market. Tree-based models capture interaction effects a single regression would miss.

Market trend

COMPONENT 3 · MARKET · Repeat-sales regression

Tracks pure market cycles by analyzing works that have sold more than once. Differencing pairs of sales isolates market movement from artwork-specific drift.

One artwork. One estimate.

For any artwork at any point in time, the final LAE is the sum of these three components. The result is a current estimated price with a confidence range — and, because the components are time-aware, an estimate for any historical point.

Recency and model versioning

Every LiveArt Estimate is dated and versioned. Models are retrained on a recurring cadence as new auction results settle, and each estimate carries the timestamp of its most recent computation together with the identifier of the model that produced it. Where that provenance is available, we surface it alongside the figure, so a valuation always reads as a dated model output rather than a static number.

3.Why three models

The obvious question is why not use one big model. The answer comes down to three properties of the art market that a single model handles poorly.

Data is sparse per artwork

Most artworks transact once or not at all. A model that needs many observations per artwork has nothing to learn from. Pairs analysis on repeat sales sidesteps this.

Quality is unobserved

Artist significance, retrospective inclusion, condition nuance — these matter to price but aren't in the data. Differencing techniques cancel out these latent factors when comparing repeat sales.

Interactions dominate

A Basquiat painting and a Basquiat drawing don't trend together. A linear model can't capture that. Tree-based models can — but only if the question they're solving is narrow enough.

Signal leak — where one model's output contaminates another's training data — is the most common failure mode in art-market AI. Separating questions enforces clean inputs.

4.Historical estimates

Because LAE is built on a time-aware architecture — the three components each carry time information — the model can produce an estimated price for any artwork at any point in its history. That makes portfolio analytics possible.

PRO FORMA RETURNS — Returns on hypothetical holding periods. Pick any two dates, get a return — at the artwork, artist, or portfolio level.
INDEX BENCHMARKS — Compare against LiveArt indices, blue-chip cohorts, or traditional benchmarks (S&P 500, bonds, gold). Same currency, same period.
PORTFOLIO ANALYTICS — Sharpe ratios, drawdowns, correlation matrices, volatility — derived from continuous price series, not just confirmed sales.

Compound Annual Return (CAR)

Where we report year-by-year returns — such as an artist’s Annual Returns — we use the Compound Annual Return (CAR): the annualized, compounded rate of return realized across repeat sales of the same work. CAR is the same calculation as the compound annual growth rate (CAGR); we say “return” because it is measured from actual buy–sell pairs. Because it compounds rather than averaging each year’s change, it neither overstates nor understates performance across volatile years.

5.Historical View

The Historical View turns market signals into an underwriter-facing read — accumulate, hold, sell, or watch — with a composite 0–100 score, a market-strength gauge, a confidence level, and a list of plain-language reasons. It is deliberately deterministic and rules-based: every threshold is published below, so any output can be reconstructed from its inputs. It is decision support, not investment advice.

Inputs

For an artist — or, for a held work, that work's artist cohort — the score reads a five-year window of auction activity together with the artist's trailing momentum: sell-through rate, buy-in rate, the split of lots selling above / within / below estimate, the sample size (lots offered), and 12-month price momentum. At the holding level it also reads compound annual return and the position's weight in the portfolio. Missing inputs lower the confidence rather than breaking the score.

Market strength (0–100)

The demand-and-liquidity axis is a weighted blend of whatever signals are present, re-normalised by the weights that actually fire: sell-through (weight 0.5, mapped 50%→90%), share sold above estimate (0.4, mapped 20%→60%), a below-estimate penalty (0.2), and a buy-in penalty (0.3, mapped 10%→40%). The question it answers: can this work be sold, and is demand beating the trade's own pricing?

Trend

Twelve-month momentum sets the trend: rising above +5%, falling below −5%, otherwise flat. A separate “peaking” flag fires above +25% — the recent-run-up-equals-correction-risk signal from the art-finance literature.

The signal

Market strength is crossed with trend, deliberately, so that “sell” and “watch” are not the same recommendation:

Weak market (below 45) → Watch — illiquid; exit is hard, so monitor rather than act.
Strong market (65+), peaking or falling → Sell — realise value while the work is still liquid.
Strong market, rising → Accumulate.
Strong market, flat → Hold.
Moderate market (45–64): falling → Watch (don't sell into weakness); peaking → Sell; otherwise → Hold.

The crucial distinction: a strong but falling market says Sell (exit while you can); a moderate, falling market says Watch (selling now would be into weakness). That is why a deep, liquid name with flat momentum reads Hold, while a thinner one with falling momentum reads Watch.

Composite score & confidence

The 0–100 composite used to rank holdings is an equal blend of market strength and a momentum-derived trend score (50 at flat, 100 at +20%, 0 at −20%), less a penalty for positions above 25% of portfolio value. Confidence is High when market data and momentum are both present over at least 30 lots, Medium above 10 lots, otherwise Low.

Reasons & auditability

Every rule that fires emits a structured reason — a code, a severity, and the value that triggered it — rendered in plain language and sorted so cautions appear first. Because the thresholds are fixed and published, the “why” is always reconstructable from the inputs; nothing is generated by a language model. An optional AI-written narrative may be layered on top in future, but never in place of the rules.

Limitations

The thresholds are a defensible first pass, not yet back-tested against realised sale outcomes. The artist-level read uses the artist's whole market; the per-work and per-holding versions narrow to the work's cohort and add return and concentration. The Historical View is a market read for professionals, not personalised investment advice.

6.Forward View

The Historical View reads the past — completed auction sales. The Forward View is its forward-looking complement: a structured, cited, confidence-scored reading of the forces that move an artist's market before they reach the sale room. It is organised as a bull-case versus bear-case ledger — the art-market equivalent of an equity research note — and it is advisory: it informs the top-line read without overriding the deterministic quant. Every signal carries its sources, so any claim can be checked.

The signal — the unit of analysis

We do not publish opinions in prose. The unit is a structured signal: a single sourced fact about an artist's market, tagged with its direction (bull or bear), how material it is to price, the horizon over which it plays out, a confidence level, how much of it is already reflected in past sale prices, how fresh the evidence is, and a one-line rationale linking the fact to value. A fact that does not map to a direction, a magnitude, and a horizon is treated as context, not signal. The result is a ledger in which every line can be audited on its own.

Two kinds of signal: directional and structural

Signals fall into two families that are scored separately and never mixed. Directional signals move the bull/bear balance — the view on where price is heading. Structural signals describe price-discovery quality — whether the marked price can be trusted at all — and they only ever lower it; a favourable market structure is never counted as a reason to be bullish. The separation is deliberate. Illiquidity, concentrated ownership, or a contested authority are reasons to trust an observed price less, not reasons to be bearish on the artist — and conflating the two is the most common way an art-market model misleads.

Directional axes — the price-direction view

Twelve axes carry the directional view. Each is bidirectional: depending on the evidence, the same axis can post a bull or a bear signal.

Institutional validation — Major museum exhibitions, acquisitions, and biennial selection — the slow, durable signal that an artist is entering the canon.
Gallery representation — The tier of gallery placing the work and any recent upgrade or downgrade in representation — a forward read on primary-market support, for living artists and active estates.
Supply & estate — Estate discipline, foundation oversight, and catalogue raisonné status, weighed against supply overhang or estate dumping.
Authentication & title — Whether a work is authenticated, catalogued, and free of attribution, title, or restitution disputes.
Narrative momentum — Shifts in critical and media attention — rediscovery and of-the-moment heat versus fading coverage.
Speculation risk — Signs of overheating, flipping, and unsustainable run-ups that historically precede corrections.
Provenance — Distinguished, single-owner, or celebrity ownership history versus thin, murky, or gapped provenance.
Primary–secondary gap — The relationship between primary-market asking prices and secondary-market results — a read on genuine underlying demand.
Critical & scholarly standing — Monographs, serious scholarship, and canonical inclusion — the academic foundation beneath a market.
Awards & honours — Major prizes, national-pavilion selection, and formal recognition.
Reputational risk — Scandal, legal exposure, or controversy capable of moving a market sharply.
Zeitgeist & cohort — Whether the artist sits in a category with structural tailwinds or headwinds — rediscovered women, regional cohorts, out-of-favour movements.

Structural axes — price-discovery quality

Eighteen axes describe ways the observed price itself becomes less trustworthy. They are impairments only — each can lower price-discovery quality but never raise it. They were identified one market archetype at a time, and a candidate is adopted only when it is a genuinely distinct mechanism that generalises beyond a single artist.

Ownership concentration — A few holders control and defend supply, so observed prices may reflect a managed float rather than a broad market.
Authentication regime — A contested, absent, or punitive authentication authority where the certifier's identity is itself price-determining.
Market maturity — A young, thin, or volatile secondary market formed inside a single speculative window, where comparables are unreliable.
Oeuvre heterogeneity — Output that splits into distinct media or series sub-markets, so an artist-level average misleads on any single work.
Liquidity float — Canonical works are largely museum-locked, leaving a near-zero private float and decades-stale comparables.
Geographic concentration — Demand and sale venues concentrated in one region, adding currency, regulatory, and regional-cycle risk and a narrow bidder pool.
Settlement reliability — Reported hammer prices that may not correspond to settled, paid transactions, so headline results overstate liquid value.
Brand & commercial dependence — A price level underwritten by an active commercial brand or hype engine rather than autonomous demand.
Ideological provenance — Documented politics or history that permanently caps the buyer and display pool — a one-directional ceiling.
Conservation fragility — Perishable, organic, or kinetic works that decay and raise a replacement-identity question when parts are remade.
Attribution stability — An accepted corpus that itself shifts as scholarship and connoisseurship evolve, moving status and value by orders of magnitude.
Mortality overhang — A known-ill or elderly living artist's supply-discontinuity uncertainty — distinct from the death event itself.
Site-specificity & title — Works fused to non-owned property, where title is contested and value can be destroyed on removal.
Signature-motif dependence — A market resting on one endlessly-repeated icon, raising fungibility and self-dilution risk.
Authentication-destruction risk — An authority that can physically seize or destroy a rejected work, not merely withhold a certificate.
Cast & edition authenticity — In reproducible cast or edition media, multiple legitimate originals coexist on a lifetime-to-posthumous hierarchy, so which cast dominates value.
Diminished-capacity authorship — Genuine doubt about the artist's agency at the time of making, splitting an authentic-by-hand body into contested tiers.
Digital-custody dependence — For digital-native work, value resting on off-the-artwork substrates — token binding, key custody, hosting, and file permanence.

First-party market signals

Alongside the AI-gathered forward signals, the bull/bear case surfaces LiveArt's own market-structure indicators — the share of lots sold above or below estimate, the bought-in rate, sell-through, 12-month momentum and long-run CAGR — drawn directly from our settled-sales record over a five-year window. They appear as primary-source cards, badged "LiveArt market data", interleaved into and leading each bull/bear column. They are evidence only: because these same metrics already feed the quantitative disposition that anchors the Outlook, they are deliberately excluded from the Forward View tilt to avoid double-counting.

Price-discovery quality

The structural impairments compound: each independent impairment multiplies the others down, so several mild impairments together can pull an artist's price-discovery quality well below half even when no single one is disqualifying. This one reading, expressed as a percentage, becomes the Structural Integrity dimension of the top-line below. A low reading is itself the flag — it tells you to weight the observed price, and any trend drawn from it, accordingly.

Anti-double-counting, confidence, and honesty

Two disciplines keep the layer additive and honest. First, a signal already absorbed into past sale prices is discounted, so the layer contributes only what the backward-looking quant has not already priced in — forward, not-yet-realised catalysts carry the most weight. Second, low-confidence signals are shown but excluded from scoring, and a catalyst whose date has passed is dropped from the forward view until its outcome is confirmed. Monetary figures appear only when sourced; unproven claims are explicitly hedged; market manipulation is never asserted as fact.

7.Outlook

The Outlook is the single top-line read. It fuses the backward-looking Historical View with the forward-looking Forward View into three independent dimensions — Direction, Conviction, and Structural Integrity — plus a one-line plain-language summary. The three are reported separately on purpose: a call can be clear in direction yet low in conviction, or high in conviction yet hard to act on. Collapsing them into one number would hide exactly the distinctions a professional needs.

Direction — Accumulate · Hold · Reduce · Monitor

Direction blends the quant score with the net Forward View tilt into a single reading, then bands it:

Accumulate — the weight of evidence favours adding to a position.
Hold — the read is balanced or stable; no action is indicated.
Reduce — the weight of evidence favours trimming.
Monitor — conviction is too low to take a position; shown with the direction it leans, this is the honest “not yet known.”

These four labels are the fused top-line. They take the Historical View's own accumulate / hold / sell / watch as one of their inputs, alongside the Forward View tilt — so the top-line can lean against the raw quant signal when the forward evidence warrants it. On a single artwork the same Direction is expressed in the single-asset register — Buy / Hold / Sell — since you can't accumulate or reduce one object.

Conviction — High · Moderate · Low

Conviction is how much trust to place in the Direction. It blends two things: Evidence — how deep and reliable the underlying market data is — and Agreement — how closely the backward-looking and forward-looking reads point the same way. We combine them so a shortfall in either tempers the result, and we surface disagreement rather than averaging it away: when the past and the forward view conflict, conviction falls even if each read is individually clear. Low conviction is what turns a Direction into a Monitor.

Structural Integrity — High · Moderate · Low

Structural Integrity asks a different question: can you actually act on the call? It is the price-discovery-quality reading from the qualitative layer, surfaced as its own dimension and deliberately kept out of Conviction. A museum-locked masterpiece can be a perfectly high-conviction Accumulate that is nonetheless hard to execute; folding executability into conviction would wrongly mute the view. When structural integrity is impaired, the top-line names the reason in plain language — museum-locked float, concentrated ownership, contested authentication, fragmented sub-markets, and the like.

The summary line and divergence

The three dimensions resolve into one sentence — for example, “Hold · Moderate conviction · limited structural integrity (museum-locked float).” Where the forward qualitative read disagrees with the backward quant read, the divergence is shown explicitly as the headline insight, because a forward signal that contradicts the price chart is often the most decision-useful output the system produces.

When there is no qualitative ledger

Most artists do not yet carry a qualitative ledger. For them the top-line reads off the Historical View alone — no qualitative adjustment, full structural integrity assumed, conviction set by the strength of the market data — with no false precision added. The qualitative layer only ever sharpens a read where genuine, sourced signals exist.

Like the Historical View, the Outlook is decision support for professionals — a structured market read, not personalised investment advice or a price target.

8.Price momentum

Repeat-sales-filtered 12-month signal at the artist or category level.

Real-time structured signals from auction calendars, results, and corrections.

9.Similarity & embeddings

64-dimensional vectors enabling similarity comparison and clustering across 350K+ artists.

Comparable-artwork retrieval based on visual and metadata features — the workhorse behind cataloguing and search.

Cataloguing workflows: artist attribution, medium detection, edition matching from photos.

10.Indices

LiveArt indices track a basket of artists as a single, continuously-revalued series. The flagship LiveArt 100 (LA100) follows the 100 highest-market-cap artists; planned sub-indices apply the same machinery to a segment, cohort, or price tier. Each member contributes the monthly, normalized artist index described above, aggregated into one index level.

Construction

An index level is the chain-linked, weighted average of its members’ monthly returns, compounded onto a running level and rebased to the family base described below. Because members re-price at different times — some after every sale, others only once a year — each is carried at its most recent value every month; a member with no new sale that month simply holds flat (a 0% return at its full weight, never dropped or renormalized away). This marks the basket to last price, so a single fresh sale in an otherwise quiet month cannot dominate the index and a data gap never injects a false zero return.

Segment & cohort indices

Beyond the flagship LA100, the family includes curated segment and cohort indices (for example, regional, gender, and category baskets). Because the data has no reliable server-side filter for these segments, membership is a fixed, multi-source-sourced allowlist of artists — drawn from auction-house specialist departments, museum and reference canons, and non-Western market coverage — ranked to the top N of that canon by market capitalization and reconstituted annually. The same chain-linked, capped-market-cap engine applies. Each segment index uses a single-name cap chosen for its universe shape and states, in its own methodology, whether it meets the UCITS 5/10/40 diversification standard or is shown as indicative; it also notes any constituents the model cannot yet capture (for instance, artists whose market is primarily domestic rather than international auction flow).

Weighting

Members are weighted by market capitalization, subject to a single-name cap — 10% for the LiveArt 100 — applied iteratively with the excess redistributed pro-rata across uncapped members. Capping prevents any one artist from dominating the basket while preserving the economic ordering of the rest.

Market capitalization

Each constituent's market capitalization is the aggregate LiveArt Estimate (LAE) of every work by that artist that has ever sold at public auction, valued as of today; an index's market cap is the sum across its constituents, and is also the denominator of the market-cap weighting. It is a model-derived valuation — not a float of tradable shares — and it revises as new auction results update the underlying LAE. By construction it captures only the publicly-auctioned body of work: pieces sold solely in private transactions, or held in private collections or institutions and never offered at auction, are not represented.

Base

Every LiveArt index is rebased to 1000 at January 2000 — the same convention governs the per-artist artist index and every basket built from it, so absolute levels are directly comparable across the whole family. An index whose history begins after January 2000 is based at 1000 on its first available month and is comparable from that inception forward. A series' base month is fixed the first time it is published and never moves again: should an earlier sale later extend the history, the new months simply plot relative to the existing base rather than re-anchoring it — so a level that has already settled a market is never retroactively rescaled. The base value is presentational — 1000 echoes broad market indices such as the FTSE 100 — and does not affect returns.

Rebasing and coverage

Each member series is normalized to a common base before aggregation, and the index is published from the earliest month in which a minimum share of members — 70% for the LiveArt 100 — have data. Earlier, thinly-covered months are withheld rather than shown on a sparse basket.

Rebalancing

Membership and target weights are reconstituted annually. Between reconstitutions the basket is held, so weights drift with performance and are reset to target at each annual rebalance.

Rebalancing preserves continuity: the index level on a reconstitution date is identical immediately before and after the new weights take effect. Target weights are applied by rescaling the implied holdings to the prevailing level, so the rebalance contributes no return of its own and creates no jump in the series — only the weighting of subsequent monthly returns changes. This mirrors the divisor adjustment used in classical price indices when constituents change.

Recompute cadence

Sales are ingested as they are published, and the index is recomputed and published once daily, each day producing a dated value. The series is monthly. Each member’s newest estimate is provisional — thin and prone to revision — so it is eased in rather than allowed to move the index in a single step; the latest month stays current but can revise as those estimates settle. Intraday recomputation is unnecessary because the underlying estimates are monthly and art transactions are episodic rather than continuous.

Revisions

As new auction results are recorded, LiveArt re-estimates each member’s value across its entire history; because the indices aggregate those estimates, both past and present index levels can be restated. This is standard for transaction-driven indices — S&P CoreLogic Case-Shiller and the FHFA House Price Index likewise recompute their full history with each release. Revisions are largest for the most recent periods and stabilize as history seasons. Every published value carries an as-of date and a model version, and prior vintages are retained so any earlier reading is reproducible. The live value is therefore a best estimate, not a final number; its frozen, settlement-grade counterpart is the official close, described below.

Official close

For settlement, LiveArt publishes an official close alongside the live series, so two numbers exist for any date and they can differ by design. The live value revises as described above and is for analysis only. The official close is the value of record. Each trading day a single official close is fixed at a defined UTC cutoff from the data available at that moment — a first print: it is reported immediately and is not seasoned — and the close on the last day of a month is also that month’s close. Once published, an official close is immutable: it never silently changes. A genuine upstream error is corrected only through a separately published, pre-announced, versioned correction; the original value stays on record and a consumer can pin a specific version. Every close carries the methodology and model version plus an identifier (and hash) of the exact input snapshot used, so it can be independently reproduced and verified — which is what makes it usable as a settlement oracle. Settled products — for example on-chain prediction markets — resolve on the official close for a stated date; the live value must never be used for settlement.

11.Validation

Validation is the part most AI vendors quietly skip. Here's the approach.

We train on data through year N and test on year N+1. The sequence: train on 2022, test on 2023. Train on 2023, test on 2024. Train on 2024, test on 2025. This prevents the model from interpolating between known points — a common form of cheating in time-series ML.

Stratified error reporting

Mean absolute error reported by artist tier, price bucket, medium, and region — not just a global headline number.

Calibrated confidence intervals

We check that an ±8% range actually contains roughly 80% of realized prices. Miscalibrated intervals are worse than wide ones.

Versioned models, published changelogs

Each model carries a version. Material changes ship with a changelog noting what shifted and why. Enterprise clients receive segment-level performance reports.

12.Limitations

Every model has limits. We publish ours so consumers of LAE can use it appropriately.

LAE works best for liquid artists. Most accurate for artists with sustained auction activity — typically the top 500–1,000 artists by transaction volume. Below that, confidence ranges widen accordingly.
Emerging artists are hard. Markets in rapid expansion show lagging predictions. Historical data alone is a weak predictor of current value when an artist's market is reshaping in real time.
Primary market is not in scope. Gallery and private sale prices are not in the training data. For living artists where the primary market dominates, LAE reflects auction signal only.
Auction noise is partially filtered. Manipulated sales, guarantees, and buy-ins introduce noise that no model perfectly removes. We filter what we can and surface confidence ranges as a reliability indicator.
Not every work carries an estimate. LAE is published only when the model has enough clean evidence at the right granularity. Some works — group and multi-object lots, thinly traded records, or sales still being processed — have no current estimate. Where that is the case, the product shows the facts it has and omits the estimate rather than guessing.
LAE is a starting point, not a final answer. The model augments — it does not replace the specialist, the appraiser, or the advisor. Confidence ranges exist precisely because a single number is rarely the right answer.

13.Principles

Transparency over mystique. Published methodology. Visible confidence ranges. No black boxes for prestige.
Augment experts, don't replace them. The model supports specialist judgment. It is a starting point, not a verdict.
Purpose-built models, not one giant model. Each question gets the model that fits it. We resist the urge to throw everything into a single architecture.
Continuous validation. The market changes. The model is retrained, re-tested, and reported on a regular cadence.

Our engineering team is available to discuss architecture, validation approach, and model performance in detail. Schedule a session for your quants or research desk.

TALK TO ENGINEERING →