Wikipedia talk:Disambiguation

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
WikiProject iconDisambiguation
WikiProject iconThis page is within the scope of WikiProject Disambiguation, an attempt to structure and organize all disambiguation pages on Wikipedia. If you wish to help, you can edit the page attached to this talk page, or visit the project page, where you can join the project or contribute to the discussion.

Switching to an -ing for in the middle of a disambiguation list[edit]

 – Pointer to relevant discussion elsewhere.

Please see WT:Manual of Style/Disambiguation pages#Switching to an -ing for in the middle of a disambiguation list.  — SMcCandlish ¢ 😼  21:24, 21 February 2024 (UTC)[reply]

A specific question on disambiguation[edit]

I recently promoted Therapy Dogs (2022 film) from AfC to mainspace. I would like to disambiguate Therapy Dogs (currently a redirect to Therapy dog) but I am not sure if the best action would be to create a disambiguation page at Therapy Dogs, linking the film and the dog, or a hatnote at Therapy dog. I tried to follow the guideline but I'm not sure how to interpret it in this case. Broc (talk) 11:40, 28 February 2024 (UTC)[reply]

There seems no reason for the capitalised plural to redirect to Therapy dog, and none was offered when the redirect was created, nor any Rcat added to explain it, so I would suggest a Move Request to move the film to Therapy Dogs, with hatnotes from film to dog and v.v. PamD 14:57, 28 February 2024 (UTC)[reply]
Yes, the policy WP:SMALLDETAILS seems relevant for considering "Therapy Dogs" being the film. —Bagumba (talk) 17:09, 28 February 2024 (UTC)[reply]

on what statistics should look like for hatnotes, primary redirects, primary topics[edit]

Here's some more bits of info I've gathered after someone asked at Talk:Tupelo:

  • Talk:Slow#followup to move
    • hatnote got up to 50% of traffic
    • after the move, the previously presumed primary topic barely registers though the common section does consistently get a measurably noticable chunk of traffic
  • Talk:Panis#post-move
    • a primary redirect was in place and no hatnote
    • after the move, the previous presumed primary topic got ~1.2%, ~1.4%, ~2.4%, ~1.9%
  • Talk:Zozo#post-move
    • hatnote got a small amount of traffic but almost all identifiable traffic anyway; RM received very little interest
    • after the move, the previous presumed primary topic got ~2.6%, ~1.9%
  • Talk:Mar#post-move
    • hatnote got ~5.2% compared to incoming traffic
    • after the move, the previous presumed primary topic got ~6.5%, ~4.3%, ~4%, ~2.9% (the latter being a weird month)
  • Talk:Julius#followup to move
    • a primary redirect was in place before, hatnote got ~3.5% of traffic, we noted the redirect ratio back then as 68/91 = ~75%
    • after the move, the previous redirect destination got ~11%, ~5%, ~3%, ~4%
  • Talk:Sola#followup to move
    • hatnote got ~10% of traffic
    • after the move the previous presumed primary topic got ~7%, ~7%, and then with the fall of overall traffic it was below the anonymization threshold (< 10/273 = < 3.66% with every source-destination combination - which doesn't mean it wasn't actually something different though still a minority).
  • Talk:Tete#post-move
    • the hatnote got ~11% before the move
    • afterwards the previously presumed primary topic gets ~6%, ~7.2%, ~8.2%
  • Talk:Uma#post-move
    • there was a primary redirect in place and the hatnote got ~25.3% of its traffic before the move and dis. traffic was much larger still
    • afterwards the previously presumed primary topic gets ~8.2% interest
  • Talk:Ty#post-move
    • hatnote got a very small amount of traffic (max 220/13742 = ~1.6%), was only #5 in the top list
    • after the move, the previously presumed primary topic was sorted down in the list, got ~7% clicks the next month; sorted back up the following month it got ~8%, ~7.5% clicks, though ~53% / ~55% / ~60% of identifiable outgoing
  • Talk:Hum#full disambiguation again
    • used to be primary redirected to human humming, went back and forth a bit
    • previous primary redirect destination gets ~10%, ~8%, ~8%
  • Talk:Marr#post-move
    • hatnote got ~10% of traffic
    • after the move the previous presumed primary topic got ~11%, ~10.5%, ~11%
  • Talk:Cam#post-move
    • hatnote got ~3.3% of traffic
    • after the move the previous presumed primary topic got ~11.5%, ~8.6%
  • Talk:Luz#post-move
    • hatnote got ~9.7% of traffic, total disambiguation list traffic at times substantially higher as well
    • after the move the previous presumed primary topic got ~12%, ~7%, ~7.7% among several other topics
  • Talk:Kolya#post-move
    • hatnote got ~1.4%, total disambiguation list traffic substantially higher than that
    • after the move the previous presumed primary topic got ~13%, ~11%, ~8%
  • Talk:Shoot#post-move
    • hatnote got ~1% of traffic
    • after the move, the previously presumed primary topic got ~11%, ~10%, ~11%, ~12%, ~14.5% interest
  • Talk:Boyle#post-move
    • hatnote got <4% of traffic
    • after the move, the previously presumed primary topic got ~15.6%, ~33%, though it's not clear if this is an actual improvement
  • Talk:Bold#post-move
    • the previous stats indicated up to ~20% interest in the hatnote
    • after the move the interest in the previously presumed primary topic was ~16%, ~14.7%, ~18.4%
  • Talk:Julia#post-move
    • hatnote got ~20% of traffic
    • after the move, the previously presumed primary topic got ~14%, ~16%, ~17%
  • Talk:Slava#post-move
    • disambiguated after 18 years of being presumed primary topic, 15 years with a hatnote (March '22 1199/9356 = ~13%)
    • after the move, we continue seeing seasonal spikes for the previously presumed primary topic, and regardless of spikes it gets ~23%, ~15%, ~11% interest
  • Talk:Hamme#post-move
    • hatnote got relatively small traffic, was not measurable, most likely <10%
    • after the move, the previously presumed primary topic got ~16% interest
  • Talk:Baudouin#post-move
    • primary redirect, hatnote got < 0.5% interest compared to total incoming traffic at destination, nobody checked a comparison of redirect and disambiguation traffic
    • after the move, the previously presumed primary topic got ~16%, ~23%, ~17% interest
  • Talk:Warren#followup to move
    • hatnote got ~7% of traffic
    • after the move, the previously presumed primary topic got ~18%, ~12%, ~14%, ~15% interest
  • Talk:Hamm#followup to move
    • hatnote got ~2.5% of traffic
    • after the move, the previously presumed primary topic got ~18.4% interest
  • Talk:Mana#post-move
    • hatnotes combined got ~8% of traffic
    • after the move, the previously presumed primary topic got ~19%, ~17%, ~16% interest
  • Talk:Charlotte#post-move
    • the hatnote got ~0.7% before the move, yet there were hints that it could do better if reorganized because of ~20% measurements related to the primary redirect (which we sadly can't have in a lot of cases)
    • afterwards the previous presumed primary topic gets ~21%, ~23.5%, ~21.1%
  • Talk:Lili#post-move
    • the hatnote got ~2% before the move
    • after the move, the previously presumed primary topic got ~23.2%, ~15.4%
  • Talk:Gaga#post-move
    • the hatnote got ~1.5% before the move
    • after the move, the previously presumed primary topic got ~24.8% interest
  • Talk:Saba#post-move
    • hatnote was in the top 5 of identifiable outgoing clickstreams, the ratio wasn't recorded then but was around 635/23806 = ~2.6%
    • after the move, the previously presumed primary topic got ~28.5%, ~26%, ~24%, ~23.5%, though around half of outgoing
  • Talk:Frida#post-move
    • hatnote got ~2% of traffic
    • after the move, the previously presumed primary topic got ~27%, ~19%, ~27%, ~24.2%
  • Talk:Forced march#post move to disambiguation
    • was a soft redirect between 2011 and 2015, when it was made a primary redirect, and in 2023 it was disambiguated after a discussion
    • previous primary redirect destination gets ~29%, ~27%, ~44%, ~34.1%, another meaning that wasn't even mentioned in the RFD gets much more
  • Talk:Major#post-move
    • the previously discussed primary topic was moved after a lot of discussions, at the time of the RM the hatnote had max 431/17909 (~2.4%) interest
    • afterwards it shows up with ~30.6%, ~30.3%, ~25%, ~28%, ~21.3% of reader interest, though most outgoing
  • Talk:Ultra#post-move
    • the hatnote got ~2% before the move
    • afterwards the previously presumed primary topic gets just ~29%, ~30.6%, ~26.5%
  • Talk:San Juan#followup to move discussion
    • before the discussion, the most popular link is alphabetically sorted in the middle of a big list and got ~22% clickstreams
    • after the discussion, it's on top of a MOS:DABCOMMON section, and gets ~29%, ~38%, ~26%
  • Talk:Severian#post-move
    • hatnote got ~1.7%
    • after the move, previously presumed primary topic got ~37%, ~33%, ~31%
  • Talk:Tiro#post-move
    • primary redirect was in place and we could measure ~18% of interest in the hatnote
    • after the move, previously presumed primary topic got ~36%, ~28%, ~36%
  • Talk:Mons#post-move
    • hatnote got ~2%
    • after the move, previously presumed primary topic got ~38.5%, ~33.3%
  • Talk:Angles#followup to move discussion
    • with primary topic in place, hatnote got less than 0.5% compared to incoming traffic and wasn't even in top 20 clickstreams
    • after the move, the previously presumed primary topic gets ~46.5%, ~43.6%, ~43.7%, ~42%
  • Talk:Lord Cameron#followup to move discussion
    • proposed primary redirect quite recent, no consensus
    • proposed primary topic later got ~47%, ~54.8%, ~36.3% of incoming traffic
  • Talk:Erasure#post-move
    • hatnote got < 0.3% interest, I myself doubted that there's a navigation issue there, but it wasn't completely clear
    • after the move, the previously presumed primary topic got ~48%, ~47%, ~44% interest
  • Talk:AEG#post-move
    • hatnote got ~1% of traffic, I was skeptical because of that and long-term significance aspects
    • after the move the previous presumed primary topic got ~49%, ~47.5%, though ~90% of identifiable outgoing
  • Talk:Motul#followup to move discussion
    • proposed primary topic gets ~51%, ~49%, ~52%, ~48%, ~51%
    • no consensus to move
  • Talk:Sugar Man#post-move
    • was a primary redirect, the ratio between redirect views and disambiguation views was between ~23% and ~43% but then mostly going back
    • after the move the previous redirect destination got ~60%, ~61.5%
  • Talk:Heavy metal music/Archive 13#Requested move 12 June 2023
    • proposed primary topic got ~60% of traffic, though a fair bit of of it came from there, too
    • the discussion was overwhelmingly against the move regardless
  • Talk:Sean O'Malley (fighter)#Requested move 5 March 2024
    • proposed primary topic got ~71% or ~84% of traffic, one other topic identifiable as well
    • no consensus
  • Talk:Michael Fagan#Requested move 17 February 2024
    • proposed primary topic got between around 70 to around 85 percent of traffic for years and across spikes in usage
    • no other major topics by long-term significance, consensus to move by usage

I think I'll have to keep updating this summary here to build up a knowledge base. --Joy (talk) 15:19, 1 March 2024 (UTC)[reply]

--Joy (talk) 14:44, 6 April 2024 (UTC)[reply]

This isn't to say that all of these moves were truly warranted or that there aren't a plethora of individual factors at play. But even with this spread of outcomes, there's something distinctly off with our current near-consensus interpretation of how stats should look like for primary topics by usage. This also means little for considerations of long-term significance. --Joy (talk) 15:30, 1 March 2024 (UTC)[reply]

Your numbers for % of "interest" are rather misleading, as they don't mention that in many cases, the largest percentage by far was "no traffic at all", i.e. no clickthrough, for whatever reason (visits not by humans? wanted target not available? info on disambig page sufficient?). E.g. for Hamme, you note "after the move, the previously presumed primary topic got ~16% interest", which was more than 4 times as much as the other topics "combined". Fram (talk) 14:51, 11 April 2024 (UTC)[reply]
Yes, I've mentioned this several times. I agree it is misleading to simply assume that every incoming view counts for something meaningful. Truth is that there is no way for us to know what if anything those 'dead-end' incoming page views signify. olderwiser 15:34, 11 April 2024 (UTC)[reply]
@Bkonrad exactly, but it's not exactly that we don't know that they don't signify anything. There's multiple hints that they do:
First of all, there is a spread of cases here, ranging from where we see a lot of the incoming views translate into clicks, to where we see few of the incoming views translate into clicks. I didn't summarize all of that information on this talk page, you have to go click through the links to examine that. (I may find some time later to extract that dimension of data and extend the list above.)
That means that we're not just consistently seeing some ghost traffic always, rather, we've got to be observing actual reader behavior at least to an extent. So we can't just see e.g. 60% of traffic translate into clicks in a fresh case and then jump to the conclusion that most of the remaining 40% is ignorable.
Secondly, there are cases where we see almost all of the incoming views translate into clicks. The most recent such example I found is described at Talk:Forced march#post move to disambiguation, where our identification rate went from 34/55 (~62%) in the first month observed, to 96/96, to 95/95, to 135/135 in the last three months, amazingly enough, even at such a small amount of traffic.
This negates even the idea that there's always got to be at least some of this ghost traffic, because apparently we have a falsifying scenario that seems quite consistent. So we can't just see e.g. 75% of traffic translate into clicks and then jump to the conclusion that any part of the remaining 25% is ignorable.
--Joy (talk) 12:33, 14 April 2024 (UTC)[reply]
On the individual points raised by @Fram earlier:
  • meta:Research:Wikipedia clickstream says it tries to exclude visits not by humans: We attempt to exclude spider traffic by classifying user agents with the ua-parser library and a few additional Wikipedia specific filters. It's certainly possible that it misses, but then the page views "User" category is likely missing, too, so I don't know that we should rely on that being a major effect.
  • Wanted target not available - how would this improve the odds for the claim that there would have to be a primary topic, when there'd be topics that detract from there being a primary topic yet they're not even available? That would seem to just raise the risk of astonishing more contingents of readers. "These people don't even know about meaning X, and they proclaimed meaning Y as the main one - pfft!"
  • Info on disambig page sufficient - this use case is indeed not studied at all, and I agree that it seems possible for at least some cases. Ultimately, why would we consider all navigations that do not result in another click bad? IOW surely this also detracts from the idea of there being a primary topic, if there is also the contingent of readers who we cannot convince to click on the link to read about the proposed primary topic (which is also usually the very first link in the list).
  • More than 4 times as the other topics combined - I've explained already at Talk:Hamme (disambiguation) how you are making weirdly incorrect statements. Even if we compare 28 identified clickstreams and the 10 identified clickstreams, the ratio between those two numbers is 2.8, it is simply not 4. Likewise, both 28 and 10 are so close to the anonymization threshold that it's not at all clear that this ratio has to be precise. In other words, this could have been 4 or it could have been 2 with just a few more src-dest pairs of views identified as opposed to anonymized out. And none of these ratios are impressive when we also see a lot more traffic interested in neither of these.
In any event, thanks for the interest. --Joy (talk) 12:33, 14 April 2024 (UTC)[reply]
Yes, I'm not saying the count of incoming views should be completely ignored, only that trying to read into what it signifies is highly speculative and we should be very cautious about what significance we attribute to such dead-end views. If there is a sudden change in the number of such incoming views, that likely merits some further consideration. Similarly, if there is a consistent, very large gap between incoming and outgoing views, that also may merit some consideration. But even in such cases, deciding what readers reaching such dead-end views were looking for when arriving at a particular disambiguation page is still highly speculative. It could play a factor in arguing that there is no primary topic where there is none at present (i.e., where there is a request to replace a disambiguation page with a primary topic). I'm not sure what significance we could read into such dead-end views of a disambiguation page where there is an existing primary topic. It could be readers are just curious about what else might have the same name, without intending to look at any of them in more detail. We just don't know why such readers behave that way. olderwiser 13:46, 14 April 2024 (UTC)[reply]
Agreed, the change in pattern would be a significant indicator. But on that front, I point again to evidence above - we often observe a clearly consistent pattern, and then we do a switch for whatever reason, and then we the data switches to observing a clearly different consistent pattern. Well, it often takes a few months for things to settle, and in the interim period there's a swing or two, but still.
In cases where there is already a primary topic selected, it's very hard to read into the no-clickthrough traffic. Because the content is larger and varied, it could be any number of possibilities. Just like it could be readers who are navigated wrongly and just immediately click away, it could also be misnavigated readers who stayed and learned something and then clicked away, or it could be a bunch of completely content readers who were absolutely happy to read what was in front of them and had no need to immediately learn more about another related topic. We don't have the tools to discern these.
With simpler pages like the disambiguation lists, however, it's less hard to understand the general reader behavior because we don't present people with huge amounts of possibilities, we reduce that number and streamline their options, and make it more likely we can understand the measurements of our existing tools.
What I think we should learn from all this is that we should not be too cautious and instead we should not be afraid to experiment as much as we have been so far.
In all this data I've tracked, we've yet to observe a case where there was a fresh reader complaining about disambiguation lists being the wrong choice. As long as we apply MOS:DABCOMMON, and we do, we have no indication that we're confusing or troubling any appreciable amounts of readers even in contentious cases. --Joy (talk) 14:35, 14 April 2024 (UTC)[reply]
In all this data I've tracked, we've yet to observe a case where there was a fresh reader complaining about disambiguation lists being the wrong choice. This seems a peculiar criterion. Quite aside from reactions to the lists you have been compiling, I can't recall the last time I came across a "fresh" reader ever complaining about an incorrectly placed disambiguation page where the complainant was not a myopic partisan seeking to promote their preferred topic.
Regarding What I think we should learn from all this is that we should not be too cautious and instead we should not be afraid to experiment as much as we have been so far. I'm glad you are taking a deeper dive into the data, but I hope no one is being misled that the reems of data of uncertain quality based on poorly documented functions represents an agreed upon approach to making decisions. olderwiser 16:02, 14 April 2024 (UTC)[reply]
We actually have some interesting data points about that, too, cf. Talk:Tito (disambiguation), where nobody really paid attention for over a decade as disambiguation was in place, and then a consensus of editors practically instantly chose to apply a primary topic redirect mainly for long-term significance. (On the plus side, that flip allowed us to measure something else afterwards, Wikipedia talk:Disambiguation/Archive 56#on the quality of clickstream and pageviews usage data explains more.)
I'm pretty sure if we go through other cases we can also find similar timeframes, where some arbitrary navigation choice has been in place for years and decades, and then we arbitrarily decide to congregate, make fun new decisions and pat ourselves on our collective backs :)
IOW our decision-making process seems perfectly sound (mostly to me too, I'm not excluding myself here), but so much goes through the cracks that it's doubtful that much of it really matters as much as we think it does. --Joy (talk) 16:49, 14 April 2024 (UTC)[reply]

The redirect DAB page has been listed at redirects for discussion to determine whether its use and function meets the redirect guidelines. Readers of this page are welcome to comment on this redirect at Wikipedia:Redirects for discussion/Log/2024 March 2 § DAB page until a consensus is reached. Jay 💬 16:06, 2 March 2024 (UTC)[reply]

In the discussion linked above, we have something of a weak consensus to stop strictly sequestering disambiguation from set indices in all cases. Does anyone see any reason not to draft changes to this guideline to incorporate some of these possibilities? --Joy (talk) 15:44, 6 March 2024 (UTC)[reply]

a change in page views between primary topic and primary redirect[edit]

A few years ago, I had noticed Sumber, Cirebon was at "Sumber" and moved it away, but kept a primary redirect because it wasn't clear if there was not a primary topic. Now that I had a look at the graph of monthly page views it looks like that change significantly altered the overall traffic at "Sumber" - it went from an average of >200 a month to <20 a month.

This difference of an order of magnitude seems to be another confirmation for how our choices about navigation influence traffic in unforeseen ways - it looks like if we put something in a presumed primary topic position, the search engines drive traffic there way more than they do otherwise, even with a primary redirect in place. This sort of puts a significant dent in our logic of figuring out primary topic by usage - once we have a presumed primary topic, we can't really trust our statistics about that to tell a straightforward story.

Now, this property of the system could have both positive and negative effects - maybe we should think of this in terms of: by choosing to put a topic in the primary topic position, we can intentionally drive traffic to it and more effectively contribute to the spreading of knowledge. At the same time, this indicates a need to have more substantive long-term significance discussions, because we if we have a chance to influence reader traffic like this, we want to make sure we do it for the right topics, as opposed to doing it arbitrarily and/or effectively hiding ambiguity. --Joy (talk) 10:16, 9 March 2024 (UTC)[reply]

I just found another case of this, just in the other direction: Talk:Bosut (disambiguation)#primary topic. --Joy (talk) 10:28, 9 March 2024 (UTC)[reply]

At Talk:Bold#post-move, we see a change from primary redirect to disambiguation page leading to significantly more traffic at the latter. --Joy (talk) 21:20, 14 March 2024 (UTC)[reply]

What would be the best way to name this article, given that we have Fifth Avenue Hotel for a different (former) hotel on the same avenue? I don't think using the word "The" is the correct way — Martin (MSGJ · talk) 11:31, 5 April 2024 (UTC)[reply]

Options include using the date it opened, i.e. The Fifth Avenue Hotel (opened 2023) or the street it is on, i.e. The Fifth Avenue Hotel (28th Street) — Martin (MSGJ · talk) 11:33, 5 April 2024 (UTC)[reply]
As long as the correct title is The Fifth Avenue Hotel in accordance with WP:THE, then WP:SMALLDIFFS kicks in, and we needn't add anything further to the title. All we need is hatnotes on each article pointing to the other. Station1 (talk) 03:55, 6 April 2024 (UTC)[reply]

RDAB speedy criteria[edit]

See Wikipedia talk:Criteria for speedy deletion#RFC new R5 for a proposal to make RDAB errors a speedy criteria. Crouch, Swale (talk) 19:09, 6 April 2024 (UTC)[reply]