Is Göbekli Tepe's Pillar 43 really a 13,000-year-old star map?

An independent statistical decomposition of the claim that animal carvings at Göbekli Tepe encode constellations and the date of the Younger Dryas catastrophe.

2026-05-24

01 / Question

In 2017, physicist Martin Sweatman published a paper arguing that animal carvings on Pillar 43 at Göbekli Tepe represent star constellations encoding a precise date — 10,950 BC — corresponding to the Younger Dryas catastrophe. His revised statistical claim stands at 1 in 1.4 million. If true, it would push back systematic astronomy by 7,000 years and provide evidence for a lost pre-ice-age civilization.

Does the statistical evidence actually support that interpretation?

02 / Short version

The reported significance of 1 in 1.4 million substantially overstates the evidence. The date component is not rare — 43% of the 20,000–5,000 BC window lies within 250 years of a notable catastrophe, and 31% of random constellation assignments produce a date match. The visual similarity between carvings and constellations is genuinely unusual (1 in 391,000), but this signal is overwhelmingly carried by three of six mappings (1 in 85,184 vs 1 in 187 for the remaining three). Those rankings were self-scored by the hypothesis proponent and have never been independently validated. The overall significance varies by two orders of magnitude across defensible analyst choices.

03 / Main article

Göbekli Tepe is a monumental stone site in southeastern Turkey, built around 9,500 BC. That's about 6,000 years before Stonehenge and 7,000 years before the Great Pyramid. It consists of massive T-shaped pillars arranged in circles, many of them carved with elaborate animal reliefs — scorpions, foxes, vultures, snakes, boars. Nobody disputes its age or its importance. It's one of the oldest known examples of monumental architecture on Earth.

Göbekli Tepe, southeastern Turkey. Circular enclosures with T-shaped pillars, built around 9,500 BC — roughly 7,000 years before the Great Pyramid. Credit: Wikimedia Commons

Pillar 43, known as the "Vulture Stone," is the most densely decorated pillar at the site. Its broadside is covered with animal carvings: a large vulture with a disc balanced on its wing, a scorpion, a wolf-like animal, a tall bird grasping a snake, and several others.

Pillar 43, the "Vulture Stone." The most densely carved pillar at Göbekli Tepe. A vulture with a disc on its wing dominates the center, with a scorpion below and other animals surrounding it.

The archaeological team that excavated the site interprets these as funerary art — scenes related to death rituals and excarnation (the practice of exposing the dead to vultures). The disc might be a severed head. The animals might be symbolic representations tied to burial customs.

In 2017, a physicist named Martin Sweatman proposed something radically different. He argued that these animals aren't just animals. They're constellations. And together, they encode a precise date: 10,950 BC.

That date corresponds to the Younger Dryas — a sudden, extreme climate collapse that plunged the Earth back into near-glacial conditions for over a thousand years. Temperatures dropped several degrees within decades. Megafauna went extinct across multiple continents. Human populations were severely disrupted. Some researchers believe it was triggered by a comet or asteroid impact. It's one of the most dramatic environmental events in human prehistory.

GISP2 ice core data from Greenland. The red line shows the Younger Dryas temperature plunge around 12,900 years ago — a sudden drop of several degrees that lasted over a thousand years. Adapted from Haynes.

If Sweatman is right, the implications are enormous. It would mean that people living around 9,500 BC carved a memorial to an event that had happened roughly 1,400 years before their time — an event their ancestors had witnessed and passed down through generations. And it would mean those ancestors, living 13,000 years ago, were sophisticated enough to track the stars, recognize constellations, understand that the sky shifts over millennia, and encode that knowledge in a form that survived. That would push back the timeline of systematic astronomy by roughly 7,000 years before the Egyptians or Mesopotamians.

This is why the claim has attracted so much attention beyond academia. It feeds directly into the idea — popularized by Graham Hancock — that an advanced civilization existed before the last ice age and was wiped out by the Younger Dryas catastrophe, with only fragmentary memories surviving in ancient monuments. If the Pillar 43 reading is correct, Göbekli Tepe becomes evidence for that thesis.

His statistical significance claim: 1 in 1.4 million. In other words, the probability that these animal carvings match constellations by pure coincidence is, according to him, so vanishingly small that the astronomical interpretation is essentially proven.

I built an independent statistical pipeline to test that number. 13 experiments, ~1,500 lines of code, every result reproducible from a single script. The code and data are public.

Here's what I found.

How the claim works

The constellations are always in the sky, every year, every era. Scorpius, Sagittarius, Orion — they don't come and go. What changes over thousands of years is where the sun sits on specific calendar days.

Right now, on the summer solstice (the longest day of the year), the sun is in the constellation Gemini. But this slowly shifts. About 2,000 years ago, the summer solstice sun was in Cancer. 6,000 years ago it was in Leo. This drift is called precession, and it completes a full cycle every 25,772 years.

This means each constellation gets a turn as the "summer solstice constellation," and each turn corresponds to a specific era. It's like a clock that takes 26,000 years to go around once. If you know which constellation the sun was in on the summer solstice, you know roughly what millennium you're in.

The precession clock. The sun's position on the summer solstice slowly drifts through the constellations over a 25,772-year cycle. Today it's in Gemini. In 10,950 BC it was in Sagittarius — Sweatman's claimed timestamp.

Sweatman says: the vulture carving represents Sagittarius. The disc on its wing represents the sun. Putting the sun inside Sagittarius on the summer solstice — that only happens around 10,950 BC. That's the timestamp.

The other animals — scorpion, wolf, bird-with-snake, ibex, frog — are additional constellations that fill out the scene. He compared each carving to all 44 constellations visible from Göbekli Tepe's latitude and ranked them by visual resemblance. A rank of 1 means that constellation is the single best visual match out of all 44 options. A rank of 44 would mean it's the worst.

His results: the scorpion carving most closely resembles Scorpius — rank 1 out of 44. The wolf carving most closely resembles Lupus (Latin for wolf) — rank 1 out of 44. The bird-with-snake most closely resembles Ophiuchus (a figure grasping a serpent) — rank 1 out of 44. Three perfect matches.

The other three are decent but not perfect: the bending bird matches Pisces at rank 4, the ibex matches Gemini at rank 5, the frog/bear matches Virgo at rank 6.

From all six rankings together, he calculates the overall probability of getting matches this good by random chance and arrives at 1 in 1.4 million. His conclusion: this can't be a coincidence; the carvings must represent constellations encoding the date of the Younger Dryas.

Problem 1: The date isn't special

The period from 20,000 to 5,000 BC was catastrophically eventful. Ice ages ending. Massive floods. Supervolcanic eruptions. Abrupt climate reversals. Species extinctions. The Younger Dryas is the most famous of these events, but it's far from the only one. There's the Older Dryas (a similar cold snap about a thousand years earlier), the Bonneville megaflood, the Heinrich Event 1 iceberg collapse, the Bølling-Allerød rapid warming, and at least a dozen more.

I compiled a catalog of 14 major event clusters from the published paleoclimatic record and asked a simple question: how much of this era is "near" a notable disaster?

The answer: 43%. Nearly half of the entire 15,000-year window is within 250 years of at least one catastrophe. Pick a random date from this era and you have close to a coin-flip chance of landing near something terrible.

43% of the period from 20,000 to 5,000 BC falls within 250 years of a major catastrophe. Pick a random date from this era and you're nearly as likely to land near a disaster as not. Each red tick marks one of 14 cataloged event clusters.

Then I tested all 52 possible constellation-and-cardinal-day combinations (13 ecliptic constellations × 4 cardinal days — spring equinox, summer solstice, autumn equinox, winter solstice). Each combination produces a different precession date. If the vulture were Scorpius instead of Sagittarius, you'd get ~15,500 BC — near the Bonneville Flood. If it were Leo, you'd get ~9,175 BC — near a major cold event. If it were Virgo, ~12,035 BC — near the Older Dryas. 16 out of 52 combinations — that's 31% — land near a cataloged event.

Sweatman's Sagittarius at summer solstice is just one of many that work.

Here's the twist: removing the Younger Dryas itself from the catalog doesn't change the result at all. That's because the Sagittarius center date (11,835 BC) is actually closer to the Older Dryas than to the YD onset. Sweatman's chosen timestamp doesn't even point cleanly at the event he claims it commemorates.

Landing on a date near a catastrophe during the deglacial era is like landing near a restaurant in Manhattan. It doesn't mean someone was aiming for one.

Problem 2: The visual similarity is real — but three carvings do all the heavy lifting

Now here's where it gets interesting, because the visual matches aren't fake.

Remember, Sweatman ranked each of his 6 animal carvings against all 44 visible constellations. If the carvings had nothing to do with constellations, you'd expect random ranks — something like 20, 30, 15, 38, 7, 42. Add those up and you get a sum around 135. That's what chance looks like.

Sweatman's ranks are [1, 1, 1, 4, 5, 6]. His sum is 18. That's wildly below what chance predicts.

I computed the exact probability of getting a sum this low or lower by pure chance: 1 in 391,000. That's genuinely unusual.

But then I did something Sweatman never did. I split his six matches into two groups and ran the numbers separately.

The three best matches — scorpion → Scorpius, wolf → Lupus, bird-with-snake → Ophiuchus — all ranked 1st. Together they produce a probability of 1 in 85,184. Very strong. You would not expect three random rankings to all come out on top like that.

The three weakest — bending bird → Pisces (rank 4), ibex → Gemini (rank 5), frog/bear → Virgo (rank 6) — together produce a probability of 1 in 187. Still below conventional significance thresholds, but far weaker.

The decomposition. Three of Sweatman's six matches (scorpion, wolf, bird-with-snake) carry almost all the statistical weight at 1 in 85,184. The other three contribute 1 in 187 — still significant, but 450 times weaker. His headline number bundles both groups as if they're equal.

So the impressive overall number (1 in 391,000) is mostly coming from just three carvings. To see how much each one matters, I tried removing them one at a time. Remove the scorpion and the significance drops by a factor of 14.7. Same for the wolf or the bird-with-snake. Remove the frog/bear instead and it barely moves — a factor of 1.9.

What this means: Sweatman presents six constellation matches as though they're all contributing to his case. In reality, three of them are doing almost all the work. The other three still contribute signal, but far less statistical weight. His headline number looks impressive because it bundles a very strong signal from three carvings with a much weaker signal from three others, and presents the total as if all six are equally compelling.

Problem 3: The rankings are self-scored

Sweatman ranked how well each animal matches each constellation himself. All 264 pairs (6 animals × 44 constellations). He proposed the hypothesis, and then he generated the data that his own statistical test evaluates.

This isn't an accusation of dishonesty. Confirmation bias in visual pattern matching is well documented. When you know what you're looking for, you tend to find it. And here, the person deciding "how much does this scorpion look like Scorpius?" is the same person whose theory depends on the answer being "a lot." That's a setup for inflated results, even with the best intentions.

How sensitive is the result to those specific rankings? I tested it with a simple thought experiment. Imagine a slightly more skeptical scorer who looks at the same carvings and says: "These are good matches, but maybe not quite rank 1 — more like rank 2." Just one notch less generous across the board.

That single-notch shift drops the significance from 1 in 391,000 to 1 in 54,000. Seven times weaker, from a minimal change. Push the top three down to rank 3 ("good match, but not the best") and the significance drops further to 1 in 54,000 again, but now the decomposition shifts: the top three are no longer dramatically stronger than the bottom three.

That weaker result would still be statistically notable on its own. The issue is not that the signal disappears — it's that its strength is highly sensitive to subjective scoring decisions, and those decisions were made by the person whose theory depends on them.

The entire statistical case balances on the knife's edge of three subjective judgments, and nobody outside Sweatman's group has ever independently evaluated them.

Problem 4: You can't separate the match from the name

This one is subtle but possibly the deepest issue.

Why does scorpion → Scorpius work so well as a visual match? Think about it. The constellation is called Scorpius because the ancient Greeks thought those stars looked like a scorpion. Wolf → Lupus works because Lupus is literally the Latin word for wolf. Ophiuchus means "serpent-bearer" — a figure grasping a snake.

The constellation names carry the animal identity built in.

So when Sweatman holds up a scorpion carving and says "look, it matches Scorpius!" — is that because Neolithic people independently saw a scorpion in that patch of sky 13,000 years ago? Or is it because the Greeks later named those same stars "the scorpion," and now any scorpion carving from any culture in any era will automatically match?

There are scorpion carvings all over the ancient world — Egypt, Mesopotamia, Mesoamerica. None of them are assumed to represent the constellation Scorpius.

Any ancient culture that carved common animals — scorpions, wolves, birds, bears — will produce matches against a constellation set that was named after the same common animals. The match might be real evidence of a shared astronomical tradition stretching back into deep prehistory. Or it might be an inevitable collision between two naming systems that both drew from the same pool of familiar creatures.

It's possible in principle that the Greek constellation names preserve a much older tradition — that Scorpius was already "the scorpion" long before Greece. But that would need independent evidence: recurring animal-star associations across unrelated cultures, or matches that aren't trivially explained by the animal's name. The later Greek and modern constellation framework cannot be assumed to preserve a deep-time tradition without that evidence.

This isn't hypothetical. At least one other independent analyst has applied the same approach to the same pillar and arrived at a completely different mapping — different constellation assignments, different animal names, and a date of 14,800 BCE instead of Sweatman's 10,950 BC. Both presentations look compelling when laid out visually. Both produce dates near major catastrophes. They can't both be right — but the fact that the same carvings yield multiple convincing-looking readings, separated by nearly 4,000 years, tells you how much flexibility the method allows.

The current statistical framework cannot tell the difference. And that's the problem.

How researcher choices change the result

Every statistical analysis involves choices. What counts as a "match"? Which events do you include in your catalog? What margin of error do you allow? Do you test only the summer solstice, or all four cardinal days? Do you use all six animal carvings, or just the best ones?

None of these choices are obviously wrong. But each one nudges the final number.

I tested 128 different combinations of these choices to see how much the result moves. For example: if you test all four cardinal days instead of just the summer solstice, the date-matching probability changes. If you only include major catastrophes instead of all 14 events, it changes again. If you use a tighter or looser time window (±100 years vs. ±250 years vs. ±500 years), it shifts further.

The result: depending on which combination of defensible choices you make, the significance ranges from 1 in 10 million to 1 in 111,000. That's a hundredfold range. Same data, same carvings, same constellations — just different reasonable methodological decisions.

Sweatman's specific combination — summer solstice only, all events included, ±250 year window, all 6 animals — gives him 1 in 1.27 million. But 42% of the primary methodological paths — varying only defensible analytical choices, not dropping animals — produce a result at least as impressive as his. He isn't reporting the significance of the evidence. He's reporting the significance of one particular path through a garden of forking choices, without disclosing how many other paths he could have taken.

What I'm NOT saying

I am not saying Sweatman is wrong. Three of his six animal-constellation matches are strikingly good, and I haven't found a satisfying explanation for why scorpion, wolf, and bird-with-snake all rank 1st against 44 options.

I am not saying the carvings can't be constellations. They might be.

I am not saying Göbekli Tepe isn't remarkable. It's one of the most important archaeological sites on Earth.

What I am saying: the statistical argument, as published, is weaker and more conditional than the headline number suggests. The 1 in 1.4 million substantially overstates the evidence. The date part contributes almost nothing because the era is so packed with catastrophes. The visual part is concentrated in three self-scored rankings that have never been tested by anyone else. And the final number shifts by orders of magnitude depending on choices the researcher makes but doesn't disclose.

The visual signal remains statistically surprising even after conservative adjustments. But its meaning depends entirely on accepting that these carvings are legitimately being compared to the adopted constellation set. If that interpretive step is wrong — if the carvings are just animals in a funerary scene — then the p-value loses its force, no matter how small it is.

What would help settle this

The most obvious next step would be a blind ranking experiment: show the animal carvings to people who know nothing about Göbekli Tepe or archaeoastronomy, ask them to match each animal to its most similar constellation figure, and see if scorpion, wolf, and bird-with-snake still come out on top without any prompting.

That would test whether Sweatman's rankings are reproducible by independent observers.

But even a perfect blind replication wouldn't fully settle the question. If everyone agrees the scorpion carving looks like Scorpius, that might just confirm what we already know: scorpions look like Scorpius because the Greeks named it after a scorpion. It wouldn't prove that Neolithic people were mapping the sky.

The carvings might be constellations. They might be animals in a funerary scene, exactly as the excavation team believes. The statistical evidence, as it stands, cannot distinguish between these possibilities as cleanly as Sweatman claims. A 1 in 1.4 million headline makes it sound like the debate is settled. It isn't.

All code, data, and figures are open source:

→ GitHub: github.com/simossss/gobekli-pillar43-statistics

→ Paper: [Zenodo DOI]

04 / How I used AI

This project was built entirely with AI systems in distinct roles. Claude handled technical reasoning, statistical design, code development, and manuscript drafting. ChatGPT served as an adversarial reviewer across roughly 10 review cycles, catching real errors each time — incorrect significance framing, a flawed negative control experiment, an invalid combined p-value, and inconsistent multiverse reporting. The analysis ran on an NVIDIA DGX Spark.

The script grew from 4 experiments to 13 through iterative conversation, not a predefined plan. Each result raised a new question, and each review cycle tightened the methodology. The decision not to run a blind visual matching study — and the three-layer argument that replaced it — came out of a conversation about why such a test would be inherently ambiguous.

My role was scoping, sequencing, and final judgment. I decided which questions to ask, which reviewer suggestions to accept, which framing to use, and when to stop. AI provided the throughput. I provided the direction.

Full process writeup →

05 / Sources
06 / External links
← back to index © punkjazz.ai