1.Overview
This document describes the methodologies behind LiveArt's quantitative and qualitative outputs. It begins with the LiveArt Estimate™ (LAE) — our core valuation model — and then covers the analytics built on top of it: the Historical View, the forward-looking Forward View, the Outlook that combines the two, price momentum, similarity, and the market indices.
Our stance is the opposite of black-box AI: the methods are published, the limitations are stated, and every estimate carries a confidence range. In a market where any valuation can be scrutinised by a specialist, a client, or a court, we believe transparency is the only responsible approach.
2.LiveArt Estimate (LAE)
LiveArt Estimate™ answers one question — what is this artwork worth right now? — by separating it into three distinct sub-questions, each answered by a different model. The three components compose into a single price with a confidence range.
Base price
COMPONENT 1 · BASE · XGBoost regression
The intrinsic value of an artwork independent of market timing. Trained on 100+ features per artwork: artist, medium, size, period, provenance, and more.
Artwork-specific trend
COMPONENT 2 · ARTWORK · Gradient-boosted decision trees
Captures how specific work categories — by artist, medium, period, size — drift relative to the overall market. Tree-based models capture interaction effects a single regression would miss.
Market trend
COMPONENT 3 · MARKET · Repeat-sales regression
Tracks pure market cycles by analyzing works that have sold more than once. Differencing pairs of sales isolates market movement from artwork-specific drift.
One artwork. One estimate.
For any artwork at any point in time, the final LAE is the sum of these three components. The result is a current estimated price with a confidence range — and, because the components are time-aware, an estimate for any historical point.
3.Why three models
The obvious question is why not use one big model. The answer comes down to three properties of the art market that a single model handles poorly.
Data is sparse per artwork
Most artworks transact once or not at all. A model that needs many observations per artwork has nothing to learn from. Pairs analysis on repeat sales sidesteps this.
Quality is unobserved
Artist significance, retrospective inclusion, condition nuance — these matter to price but aren't in the data. Differencing techniques cancel out these latent factors when comparing repeat sales.
Interactions dominate
A Basquiat painting and a Basquiat drawing don't trend together. A linear model can't capture that. Tree-based models can — but only if the question they're solving is narrow enough.
Signal leak — where one model's output contaminates another's training data — is the most common failure mode in art-market AI. Separating questions enforces clean inputs.
4.Historical estimates
Because LAE is built on a time-aware architecture — the three components each carry time information — the model can produce an estimated price for any artwork at any point in its history. That makes portfolio analytics possible.
- PRO FORMA RETURNS — Returns on hypothetical holding periods. Pick any two dates, get a return — at the artwork, artist, or portfolio level.
- INDEX BENCHMARKS — Compare against LiveArt indices, blue-chip cohorts, or traditional benchmarks (S&P 500, bonds, gold). Same currency, same period.
- PORTFOLIO ANALYTICS — Sharpe ratios, drawdowns, correlation matrices, volatility — derived from continuous price series, not just confirmed sales.
Compound Annual Return (CAR)
Where we report year-by-year returns — such as an artist’s Annual Returns — we use the Compound Annual Return (CAR): the annualized, compounded rate of return realized across repeat sales of the same work. CAR is the same calculation as the compound annual growth rate (CAGR); we say “return” because it is measured from actual buy–sell pairs. Because it compounds rather than averaging each year’s change, it neither overstates nor understates performance across volatile years.
5.Historical View
The Historical View turns market signals into an underwriter-facing read — accumulate, hold, sell, or watch — with a composite 0–100 score, a market-strength gauge, a confidence level, and a list of plain-language reasons. It is deliberately deterministic and rules-based: every threshold is published below, so any output can be reconstructed from its inputs. It is decision support, not investment advice.
Inputs
For an artist — or, for a held work, that work's artist cohort — the score reads a five-year window of auction activity together with the artist's trailing momentum: sell-through rate, buy-in rate, the split of lots selling above / within / below estimate, the sample size (lots offered), and 12-month price momentum. At the holding level it also reads compound annual return and the position's weight in the portfolio. Missing inputs lower the confidence rather than breaking the score.
Market strength (0–100)
The demand-and-liquidity axis is a weighted blend of whatever signals are present, re-normalised by the weights that actually fire: sell-through (weight 0.5, mapped 50%→90%), share sold above estimate (0.4, mapped 20%→60%), a below-estimate penalty (0.2), and a buy-in penalty (0.3, mapped 10%→40%). The question it answers: can this work be sold, and is demand beating the trade's own pricing?
Trend
Twelve-month momentum sets the trend: rising above +5%, falling below −5%, otherwise flat. A separate “peaking” flag fires above +25% — the recent-run-up-equals-correction-risk signal from the art-finance literature.
The signal
Market strength is crossed with trend, deliberately, so that “sell” and “watch” are not the same recommendation:
- Weak market (below 45) → Watch — illiquid; exit is hard, so monitor rather than act.
- Strong market (65+), peaking or falling → Sell — realise value while the work is still liquid.
- Strong market, rising → Accumulate.
- Strong market, flat → Hold.
- Moderate market (45–64): falling → Watch (don't sell into weakness); peaking → Sell; otherwise → Hold.
The crucial distinction: a strong but falling market says Sell (exit while you can); a moderate, falling market says Watch (selling now would be into weakness). That is why a deep, liquid name with flat momentum reads Hold, while a thinner one with falling momentum reads Watch.
Composite score & confidence
The 0–100 composite used to rank holdings is an equal blend of market strength and a momentum-derived trend score (50 at flat, 100 at +20%, 0 at −20%), less a penalty for positions above 25% of portfolio value. Confidence is High when market data and momentum are both present over at least 30 lots, Medium above 10 lots, otherwise Low.
Reasons & auditability
Every rule that fires emits a structured reason — a code, a severity, and the value that triggered it — rendered in plain language and sorted so cautions appear first. Because the thresholds are fixed and published, the “why” is always reconstructable from the inputs; nothing is generated by a language model. An optional AI-written narrative may be layered on top in future, but never in place of the rules.
Limitations
The thresholds are a defensible first pass, not yet back-tested against realised sale outcomes. The artist-level read uses the artist's whole market; the per-work and per-holding versions narrow to the work's cohort and add return and concentration. The Historical View is a market read for professionals, not personalised investment advice.
6.Forward View
The Historical View reads the past — completed auction sales. The Forward View is its forward-looking complement: a structured, cited, confidence-scored reading of the forces that move an artist's market before they reach the sale room. It is organised as a bull-case versus bear-case ledger — the art-market equivalent of an equity research note — and it is advisory: it informs the top-line read without overriding the deterministic quant. Every signal carries its sources, so any claim can be checked.
The signal — the unit of analysis
We do not publish opinions in prose. The unit is a structured signal: a single sourced fact about an artist's market, tagged with its direction (bull or bear), how material it is to price, the horizon over which it plays out, a confidence level, how much of it is already reflected in past sale prices, how fresh the evidence is, and a one-line rationale linking the fact to value. A fact that does not map to a direction, a magnitude, and a horizon is treated as context, not signal. The result is a ledger in which every line can be audited on its own.
Two kinds of signal: directional and structural
Signals fall into two families that are scored separately and never mixed. Directional signals move the bull/bear balance — the view on where price is heading. Structural signals describe price-discovery quality — whether the marked price can be trusted at all — and they only ever lower it; a favourable market structure is never counted as a reason to be bullish. The separation is deliberate. Illiquidity, concentrated ownership, or a contested authority are reasons to trust an observed price less, not reasons to be bearish on the artist — and conflating the two is the most common way an art-market model misleads.
Directional axes — the price-direction view
Twelve axes carry the directional view. Each is bidirectional: depending on the evidence, the same axis can post a bull or a bear signal.
- Institutional validation — Major museum exhibitions, acquisitions, and biennial selection — the slow, durable signal that an artist is entering the canon.
- Gallery representation — The tier of gallery placing the work and any recent upgrade or downgrade in representation — a forward read on primary-market support, for living artists and active estates.
- Supply & estate — Estate discipline, foundation oversight, and catalogue raisonné status, weighed against supply overhang or estate dumping.
- Authentication & title — Whether a work is authenticated, catalogued, and free of attribution, title, or restitution disputes.
- Narrative momentum — Shifts in critical and media attention — rediscovery and of-the-moment heat versus fading coverage.
- Speculation risk — Signs of overheating, flipping, and unsustainable run-ups that historically precede corrections.
- Provenance — Distinguished, single-owner, or celebrity ownership history versus thin, murky, or gapped provenance.
- Primary–secondary gap — The relationship between primary-market asking prices and secondary-market results — a read on genuine underlying demand.
- Critical & scholarly standing — Monographs, serious scholarship, and canonical inclusion — the academic foundation beneath a market.
- Awards & honours — Major prizes, national-pavilion selection, and formal recognition.
- Reputational risk — Scandal, legal exposure, or controversy capable of moving a market sharply.
- Zeitgeist & cohort — Whether the artist sits in a category with structural tailwinds or headwinds — rediscovered women, regional cohorts, out-of-favour movements.
Structural axes — price-discovery quality
Eighteen axes describe ways the observed price itself becomes less trustworthy. They are impairments only — each can lower price-discovery quality but never raise it. They were identified one market archetype at a time, and a candidate is adopted only when it is a genuinely distinct mechanism that generalises beyond a single artist.
- Ownership concentration — A few holders control and defend supply, so observed prices may reflect a managed float rather than a broad market.
- Authentication regime — A contested, absent, or punitive authentication authority where the certifier's identity is itself price-determining.
- Market maturity — A young, thin, or volatile secondary market formed inside a single speculative window, where comparables are unreliable.
- Oeuvre heterogeneity — Output that splits into distinct media or series sub-markets, so an artist-level average misleads on any single work.
- Liquidity float — Canonical works are largely museum-locked, leaving a near-zero private float and decades-stale comparables.
- Geographic concentration — Demand and sale venues concentrated in one region, adding currency, regulatory, and regional-cycle risk and a narrow bidder pool.
- Settlement reliability — Reported hammer prices that may not correspond to settled, paid transactions, so headline results overstate liquid value.
- Brand & commercial dependence — A price level underwritten by an active commercial brand or hype engine rather than autonomous demand.
- Ideological provenance — Documented politics or history that permanently caps the buyer and display pool — a one-directional ceiling.
- Conservation fragility — Perishable, organic, or kinetic works that decay and raise a replacement-identity question when parts are remade.
- Attribution stability — An accepted corpus that itself shifts as scholarship and connoisseurship evolve, moving status and value by orders of magnitude.
- Mortality overhang — A known-ill or elderly living artist's supply-discontinuity uncertainty — distinct from the death event itself.
- Site-specificity & title — Works fused to non-owned property, where title is contested and value can be destroyed on removal.
- Signature-motif dependence — A market resting on one endlessly-repeated icon, raising fungibility and self-dilution risk.
- Authentication-destruction risk — An authority that can physically seize or destroy a rejected work, not merely withhold a certificate.
- Cast & edition authenticity — In reproducible cast or edition media, multiple legitimate originals coexist on a lifetime-to-posthumous hierarchy, so which cast dominates value.
- Diminished-capacity authorship — Genuine doubt about the artist's agency at the time of making, splitting an authentic-by-hand body into contested tiers.
- Digital-custody dependence — For digital-native work, value resting on off-the-artwork substrates — token binding, key custody, hosting, and file permanence.
First-party market signals
Alongside the AI-gathered forward signals, the bull/bear case surfaces LiveArt's own market-structure indicators — the share of lots sold above or below estimate, the bought-in rate, sell-through, 12-month momentum and long-run CAGR — drawn directly from our settled-sales record over a five-year window. They appear as primary-source cards, badged "LiveArt market data", interleaved into and leading each bull/bear column. They are evidence only: because these same metrics already feed the quantitative disposition that anchors the Outlook, they are deliberately excluded from the Forward View tilt to avoid double-counting.
Price-discovery quality
The structural impairments compound: each independent impairment multiplies the others down, so several mild impairments together can pull an artist's price-discovery quality well below half even when no single one is disqualifying. This one reading, expressed as a percentage, becomes the Structural Integrity dimension of the top-line below. A low reading is itself the flag — it tells you to weight the observed price, and any trend drawn from it, accordingly.
Anti-double-counting, confidence, and honesty
Two disciplines keep the layer additive and honest. First, a signal already absorbed into past sale prices is discounted, so the layer contributes only what the backward-looking quant has not already priced in — forward, not-yet-realised catalysts carry the most weight. Second, low-confidence signals are shown but excluded from scoring, and a catalyst whose date has passed is dropped from the forward view until its outcome is confirmed. Monetary figures appear only when sourced; unproven claims are explicitly hedged; market manipulation is never asserted as fact.
7.Outlook
The Outlook is the single top-line read. It fuses the backward-looking Historical View with the forward-looking Forward View into three independent dimensions — Direction, Conviction, and Structural Integrity — plus a one-line plain-language summary. The three are reported separately on purpose: a call can be clear in direction yet low in conviction, or high in conviction yet hard to act on. Collapsing them into one number would hide exactly the distinctions a professional needs.
Direction — Accumulate · Hold · Reduce · Monitor
Direction blends the quant score with the net Forward View tilt into a single reading, then bands it:
- Accumulate — the weight of evidence favours adding to a position.
- Hold — the read is balanced or stable; no action is indicated.
- Reduce — the weight of evidence favours trimming.
- Monitor — conviction is too low to take a position; shown with the direction it leans, this is the honest “not yet known.”
These four labels are the fused top-line. They take the Historical View's own accumulate / hold / sell / watch as one of their inputs, alongside the Forward View tilt — so the top-line can lean against the raw quant signal when the forward evidence warrants it. On a single artwork the same Direction is expressed in the single-asset register — Buy / Hold / Sell — since you can't accumulate or reduce one object.
Conviction — High · Moderate · Low
Conviction is how much trust to place in the Direction. It blends two things: Evidence — how deep and reliable the underlying market data is — and Agreement — how closely the backward-looking and forward-looking reads point the same way. We combine them so a shortfall in either tempers the result, and we surface disagreement rather than averaging it away: when the past and the forward view conflict, conviction falls even if each read is individually clear. Low conviction is what turns a Direction into a Monitor.
Structural Integrity — High · Moderate · Low
Structural Integrity asks a different question: can you actually act on the call? It is the price-discovery-quality reading from the qualitative layer, surfaced as its own dimension and deliberately kept out of Conviction. A museum-locked masterpiece can be a perfectly high-conviction Accumulate that is nonetheless hard to execute; folding executability into conviction would wrongly mute the view. When structural integrity is impaired, the top-line names the reason in plain language — museum-locked float, concentrated ownership, contested authentication, fragmented sub-markets, and the like.
The summary line and divergence
The three dimensions resolve into one sentence — for example, “Hold · Moderate conviction · limited structural integrity (museum-locked float).” Where the forward qualitative read disagrees with the backward quant read, the divergence is shown explicitly as the headline insight, because a forward signal that contradicts the price chart is often the most decision-useful output the system produces.
When there is no qualitative ledger
Most artists do not yet carry a qualitative ledger. For them the top-line reads off the Historical View alone — no qualitative adjustment, full structural integrity assumed, conviction set by the strength of the market data — with no false precision added. The qualitative layer only ever sharpens a read where genuine, sourced signals exist.
Like the Historical View, the Outlook is decision support for professionals — a structured market read, not personalised investment advice or a price target.
8.Price momentum
Repeat-sales-filtered 12-month signal at the artist or category level.
Real-time structured signals from auction calendars, results, and corrections.
9.Similarity & embeddings
64-dimensional vectors enabling similarity comparison and clustering across 350K+ artists.
Comparable-artwork retrieval based on visual and metadata features — the workhorse behind cataloguing and search.
Cataloguing workflows: artist attribution, medium detection, edition matching from photos.
10.Indices
LiveArt indices — the artist index, blue-chip cohorts, and the global market index used for benchmarking — are built from the same repeat-sales and time-aware estimate machinery described above, aggregated to the cohort level and rebased to a common start.
A full, standalone methodology for each index family — construction, weighting, rebasing, and rebalancing rules — is being written and will be published in this document.
11.Validation
Validation is the part most AI vendors quietly skip. Here's the approach.
We train on data through year N and test on year N+1. The sequence: train on 2022, test on 2023. Train on 2023, test on 2024. Train on 2024, test on 2025. This prevents the model from interpolating between known points — a common form of cheating in time-series ML.
Stratified error reporting
Mean absolute error reported by artist tier, price bucket, medium, and region — not just a global headline number.
Calibrated confidence intervals
We check that an ±8% range actually contains roughly 80% of realized prices. Miscalibrated intervals are worse than wide ones.
Versioned models, published changelogs
Each model carries a version. Material changes ship with a changelog noting what shifted and why. Enterprise clients receive segment-level performance reports.
12.Limitations
Every model has limits. We publish ours so consumers of LAE can use it appropriately.
- LAE works best for liquid artists. Most accurate for artists with sustained auction activity — typically the top 500–1,000 artists by transaction volume. Below that, confidence ranges widen accordingly.
- Emerging artists are hard. Markets in rapid expansion show lagging predictions. Historical data alone is a weak predictor of current value when an artist's market is reshaping in real time.
- Primary market is not in scope. Gallery and private sale prices are not in the training data. For living artists where the primary market dominates, LAE reflects auction signal only.
- Auction noise is partially filtered. Manipulated sales, guarantees, and buy-ins introduce noise that no model perfectly removes. We filter what we can and surface confidence ranges as a reliability indicator.
- Not every work carries an estimate. LAE is published only when the model has enough clean evidence at the right granularity. Some works — group and multi-object lots, thinly traded records, or sales still being processed — have no current estimate. Where that is the case, the product shows the facts it has and omits the estimate rather than guessing.
- LAE is a starting point, not a final answer. The model augments — it does not replace the specialist, the appraiser, or the advisor. Confidence ranges exist precisely because a single number is rarely the right answer.
13.Principles
- Transparency over mystique. Published methodology. Visible confidence ranges. No black boxes for prestige.
- Augment experts, don't replace them. The model supports specialist judgment. It is a starting point, not a verdict.
- Purpose-built models, not one giant model. Each question gets the model that fits it. We resist the urge to throw everything into a single architecture.
- Continuous validation. The market changes. The model is retrained, re-tested, and reported on a regular cadence.
Our engineering team is available to discuss architecture, validation approach, and model performance in detail. Schedule a session for your quants or research desk.
TALK TO ENGINEERING →