Skip to content

METHODOLOGY

Methodology

How LiveArt Estimate™, the Disposition Score, and the broader model stack are built — transparently, including their limits.

1.Overview

This document describes the methodologies behind LiveArt's quantitative outputs. It begins with the LiveArt Estimate™ (LAE) — our core valuation model — and then covers the analytics built on top of it, including the Disposition Score, price momentum, similarity, and the market indices.

Our stance is the opposite of black-box AI: the methods are published, the limitations are stated, and every estimate carries a confidence range. In a market where any valuation can be scrutinised by a specialist, a client, or a court, we believe transparency is the only responsible approach.

2.LiveArt Estimate (LAE)

LiveArt Estimate™ answers one question — what is this artwork worth right now? — by separating it into three distinct sub-questions, each answered by a different model. The three components compose into a single price with a confidence range.

Base price

COMPONENT 1 · BASE · XGBoost regression

The intrinsic value of an artwork independent of market timing. Trained on 100+ features per artwork: artist, medium, size, period, provenance, and more.

Artwork-specific trend

COMPONENT 2 · ARTWORK · Gradient-boosted decision trees

Captures how specific work categories — by artist, medium, period, size — drift relative to the overall market. Tree-based models capture interaction effects a single regression would miss.

Market trend

COMPONENT 3 · MARKET · Repeat-sales regression

Tracks pure market cycles by analyzing works that have sold more than once. Differencing pairs of sales isolates market movement from artwork-specific drift.

One artwork. One estimate.

For any artwork at any point in time, the final LAE is the sum of these three components. The result is a current estimated price with a confidence range — and, because the components are time-aware, an estimate for any historical point.

3.Why three models

The obvious question is why not use one big model. The answer comes down to three properties of the art market that a single model handles poorly.

Data is sparse per artwork

Most artworks transact once or not at all. A model that needs many observations per artwork has nothing to learn from. Pairs analysis on repeat sales sidesteps this.

Quality is unobserved

Artist significance, retrospective inclusion, condition nuance — these matter to price but aren't in the data. Differencing techniques cancel out these latent factors when comparing repeat sales.

Interactions dominate

A Basquiat painting and a Basquiat drawing don't trend together. A linear model can't capture that. Tree-based models can — but only if the question they're solving is narrow enough.

Signal leak — where one model's output contaminates another's training data — is the most common failure mode in art-market AI. Separating questions enforces clean inputs.

4.Historical estimates

Because LAE is built on a time-aware architecture — the three components each carry time information — the model can produce an estimated price for any artwork at any point in its history. That makes portfolio analytics possible.

  • PRO FORMA RETURNS Returns on hypothetical holding periods. Pick any two dates, get a return — at the artwork, artist, or portfolio level.
  • INDEX BENCHMARKS Compare against LiveArt indices, blue-chip cohorts, or traditional benchmarks (S&P 500, bonds, gold). Same currency, same period.
  • PORTFOLIO ANALYTICS Sharpe ratios, drawdowns, correlation matrices, volatility — derived from continuous price series, not just confirmed sales.

Compound Annual Return (CAR)

Where we report year-by-year returns — such as an artist’s Annual Returns — we use the Compound Annual Return (CAR): the annualized, compounded rate of return realized across repeat sales of the same work. CAR is the same calculation as the compound annual growth rate (CAGR); we say “return” because it is measured from actual buy–sell pairs. Because it compounds rather than averaging each year’s change, it neither overstates nor understates performance across volatile years.

5.Disposition Score

The Disposition Score turns market signals into an underwriter-facing read — accumulate, hold, sell, or watch — with a composite 0–100 score, a market-strength gauge, a confidence level, and a list of plain-language reasons. It is deliberately deterministic and rules-based: every threshold is published below, so any output can be reconstructed from its inputs. It is decision support, not investment advice.

Inputs

For an artist — or, for a held work, that work's artist cohort — the score reads a five-year window of auction activity together with the artist's trailing momentum: sell-through rate, buy-in rate, the split of lots selling above / within / below estimate, the sample size (lots offered), and 12-month price momentum. At the holding level it also reads compound annual return and the position's weight in the portfolio. Missing inputs lower the confidence rather than breaking the score.

Market strength (0–100)

The demand-and-liquidity axis is a weighted blend of whatever signals are present, re-normalised by the weights that actually fire: sell-through (weight 0.5, mapped 50%→90%), share sold above estimate (0.4, mapped 20%→60%), a below-estimate penalty (0.2), and a buy-in penalty (0.3, mapped 10%→40%). The question it answers: can this work be sold, and is demand beating the trade's own pricing?

Trend

Twelve-month momentum sets the trend: rising above +5%, falling below −5%, otherwise flat. A separate “peaking” flag fires above +25% — the recent-run-up-equals-correction-risk signal from the art-finance literature.

The signal

Market strength is crossed with trend, deliberately, so that “sell” and “watch” are not the same recommendation:

  • Weak market (below 45) → Watch — illiquid; exit is hard, so monitor rather than act.
  • Strong market (65+), peaking or falling → Sell — realise value while the work is still liquid.
  • Strong market, rising → Accumulate.
  • Strong market, flat → Hold.
  • Moderate market (45–64): falling → Watch (don't sell into weakness); peaking → Sell; otherwise → Hold.

The crucial distinction: a strong but falling market says Sell (exit while you can); a moderate, falling market says Watch (selling now would be into weakness). That is why a deep, liquid name with flat momentum reads Hold, while a thinner one with falling momentum reads Watch.

Composite score & confidence

The 0–100 composite used to rank holdings is an equal blend of market strength and a momentum-derived trend score (50 at flat, 100 at +20%, 0 at −20%), less a penalty for positions above 25% of portfolio value. Confidence is High when market data and momentum are both present over at least 30 lots, Medium above 10 lots, otherwise Low.

Reasons & auditability

Every rule that fires emits a structured reason — a code, a severity, and the value that triggered it — rendered in plain language and sorted so cautions appear first. Because the thresholds are fixed and published, the “why” is always reconstructable from the inputs; nothing is generated by a language model. An optional AI-written narrative may be layered on top in future, but never in place of the rules.

Limitations

The thresholds are a defensible first pass, not yet back-tested against realised sale outcomes. The artist-level read uses the artist's whole market; the per-work and per-holding versions narrow to the work's cohort and add return and concentration. The Disposition Score is a market read for professionals, not personalised investment advice.

6.Price momentum

Repeat-sales-filtered 12-month signal at the artist or category level.

Real-time structured signals from auction calendars, results, and corrections.

7.Similarity & embeddings

64-dimensional vectors enabling similarity comparison and clustering across 350K+ artists.

Comparable-artwork retrieval based on visual and metadata features — the workhorse behind cataloguing and search.

Cataloguing workflows: artist attribution, medium detection, edition matching from photos.

8.Indices

LiveArt indices — the artist index, blue-chip cohorts, and the global market index used for benchmarking — are built from the same repeat-sales and time-aware estimate machinery described above, aggregated to the cohort level and rebased to a common start.

A full, standalone methodology for each index family — construction, weighting, rebasing, and rebalancing rules — is being written and will be published in this document.

9.Validation

Validation is the part most AI vendors quietly skip. Here's the approach.

We train on data through year N and test on year N+1. The sequence: train on 2022, test on 2023. Train on 2023, test on 2024. Train on 2024, test on 2025. This prevents the model from interpolating between known points — a common form of cheating in time-series ML.

Stratified error reporting

Mean absolute error reported by artist tier, price bucket, medium, and region — not just a global headline number.

Calibrated confidence intervals

We check that an ±8% range actually contains roughly 80% of realized prices. Miscalibrated intervals are worse than wide ones.

Versioned models, published changelogs

Each model carries a version. Material changes ship with a changelog noting what shifted and why. Enterprise clients receive segment-level performance reports.

10.Limitations

Every model has limits. We publish ours so consumers of LAE can use it appropriately.

  • LAE works best for liquid artists. Most accurate for artists with sustained auction activity — typically the top 500–1,000 artists by transaction volume. Below that, confidence ranges widen accordingly.
  • Emerging artists are hard. Markets in rapid expansion show lagging predictions. Historical data alone is a weak predictor of current value when an artist's market is reshaping in real time.
  • Primary market is not in scope. Gallery and private sale prices are not in the training data. For living artists where the primary market dominates, LAE reflects auction signal only.
  • Auction noise is partially filtered. Manipulated sales, guarantees, and buy-ins introduce noise that no model perfectly removes. We filter what we can and surface confidence ranges as a reliability indicator.
  • LAE is a starting point, not a final answer. The model augments — it does not replace the specialist, the appraiser, or the advisor. Confidence ranges exist precisely because a single number is rarely the right answer.

11.Principles

  • Transparency over mystique. Published methodology. Visible confidence ranges. No black boxes for prestige.
  • Augment experts, don't replace them. The model supports specialist judgment. It is a starting point, not a verdict.
  • Purpose-built models, not one giant model. Each question gets the model that fits it. We resist the urge to throw everything into a single architecture.
  • Continuous validation. The market changes. The model is retrained, re-tested, and reported on a regular cadence.

Our engineering team is available to discuss architecture, validation approach, and model performance in detail. Schedule a session for your quants or research desk.

TALK TO ENGINEERING →