More

funnymony · on Oct 21, 2023

Performance implication exist, but it is secondary.

Primary reason: distinct on every select shows either lack of knowledge of schema, in particular which columns make rows unique, or unfortunate schema design. (Apart from niche cases, schema should be somewhat normal. I.e. column parent_name belongs in the table parent, not in the table student)

Select a from x where myuniquekey=1; —- guaranted to return 1 or zero rows, if myuiniquekey is actually unique.

Select a from x join y on x.parent_id = y.y_id —- guaranteed to return same amount of rows as exist in y, never more, never duplicates y rows. (N-to-1 relation)

If distinct is used in any of above, then question “why?” naturally arises.

In more severe case, leads to bugs:

Select distinct student.student_name, parent.parent_name from student join parent on student.parent_id = parent.parent_id —- silently discards rows, where by accident student/parent name combo matches several times.

Technically sql allows comparing unrelated columns (colour=last_name), but for vast majority of cases, when joining, one of the side should be joined using it’s unique key, and other side should be using it’s foreign key, which ensures that duplicates don’t appear randomly, and thus distinct is not needed.

munch117 · on Oct 21, 2023

> If distinct is used in any of above, then question “why?” naturally arises.

Not if distinct is the default.

> Select distinct student.student_name, parent.parent_name from student join parent on student.parent_id = parent.parent_id —- silently discards rows, where by accident student/parent name combo matches several times.

Either with or without distinct can be a bug depending on what you are doing it for.

There are actually 4 variations on what you might want, and you can get all of them with distinct:

  select distinct student.student_id, student.student_name, parent.parent_id, parent.parent_name from ...
  select distinct student.student_name, parent.parent_id parent.parent_name from ...
  select distinct student.student_id, student.student_name, parent.parent_name from ...
  select distinct student.student_name, parent.parent_name from ...

magicalhippo · on Oct 21, 2023

Our main application at work is essentially a CRUD application, and I've worked on it for over 10 years now. I'm fairly confident I can count on one hand the number of cases where a join returned unexpected duplicates which DISTINCT would "fix".

Sometimes I wonder if we're just weird, somehow avoiding this issue.

funnymony · on Oct 21, 2023

These examples reminded me one more issue: change in column selection, might change number of rows, which means column addition/removal is so much riskier afair.

> Not if distinct is the default.

If that works for you, great, but let’s agree to disagree here.

munch117 · on Oct 22, 2023

We have different mental models.

Your mental model, if you will forgive the straw man, is that SELECT over multiple tables is conceptually equivalent to nested for-loops over each table, and the WHERE condition is an if-statement.

My mental model is that I'm working with sets. If yesterday I asked for the set of CITY,COUNTRY, and today I've changed that to the set of COUNTRY, then obviously the result set today is going to be much smaller. This is not a risk to me -- asking for a different set gives me a different set, I can't imagine being surprised by that.

funnymony · on Oct 21, 2023

Its a matter of your eyes being used to this.

If for ten years you always indented the code this way

    Void F()
    {    Foo();
         Bar();
         Baz();}

Then following snippet will seem hard to parse mentally:

    Void F()
    {
         Foo();
         Bar();
         Baz();
    }

And vice versa

gigatexal · on Oct 21, 2023

You mean the where exists/not exists concept? Or the select 1 usage in that correlated subquery?

funnymony · on Oct 21, 2023

funnymony · on Oct 18, 2023

Somewhat relevant. Estonia has a law that financial instutions should have servers located on Estonian soil. (Exactly for this kind of scenario)

sandworm101 · on Oct 18, 2023

A great idea in principal, but having a local server doesn't necessarily mean said server will continue functioning once severed from the rest of the internet. Things like DNS and timing signals are an issue. Until it is tested, and retested after every update, I wouldn't trust any financial server to keep ticking along once so disconnected.

jjtheblunt · on Oct 18, 2023

perhaps misunderstanding, but you could get time signals from GPS (or GLONASS)

traceroute66 · on Oct 18, 2023

> you could get time signals from GPS (or GLONASS)

You forgot to mention Galileo ... GLONASS may not be politically attractive :)

But beyond the GNSS ecosystem, there are of course other interesting options:

Safran STL[1] which uses LEO sats. Apparently it works well in places where you can't get a good GNSS signal (i.e. indoors without an external antenna). (This was previously Orolia STL, but they were acquired by Safran).

Most national time labs also offer a leased-line service, e.g. NPL (UK)[2]

There is also a very niche (read: VERY expensive) commercial timing-as-a-service product from a company called Hoptroff[3]. You license both the service/connection plus their software clients, so the $$$$$$ add up. Definitely one of those "if you have to ask you can't afford it" vendors.

[1] https://safran-navigation-timing.com/product/stl-option [2] https://www.npl.co.uk/npltime [3] https://www.hoptroff.com/

jjtheblunt · on Oct 18, 2023

Galileo too! (Pre coffee oversight)

And, recursively, from celltower signals for any CDMA-related technology, i think.

jazzyjackson · on Oct 18, 2023

sure, just a matter of what servers are configured to do ahead of time and what happens when assumptions fail - are the servers tested against a situation where they can't get the time as expected etc

c_o_n_v_e_x · on Oct 18, 2023

What happens if/when your GNSS constellation operator intentionally degrades quality (ex: GPS & selective availability)? Do you fall back to a local atomic clock assuming one is available?

senectus1 · on Oct 18, 2023

man, the more i hear about Estonia the more impressed i am. Their IT policy decisions have been amazing.

spiralx · on Oct 20, 2023

And they've open sourced a lot of the code they use for their infrastructure:

https://github.com/nordic-institute/X-Road

They also had a blockchain-based timestamping system a year before the Bitcoin white paper was released :)

funnymony · on Oct 14, 2023

Link to exact section: https://dlang.org/spec/function.html#scope-parameters

MaxBarraclough · on Oct 14, 2023

Good call, have updated my links.

funnymony · on Oct 7, 2023

This would be like learning everything from books vs learning by experience. (Can’t learn skiing from books. I talk from experience)

bch · on Oct 8, 2023

Clever - reminds me of:

In school, we learn the lesson then get the test. In life we get the test then learn the lesson.

funnymony · on Oct 4, 2023

> deserialization is validation

I saw similar, catchy phrase: “parse, don’t validate”.

https://lexi-lambda.github.io/blog/2019/11/05/parse-don-t-va...

funnymony · on Sept 1, 2023

Interesting idea. But this would require no intermixing between groups.

funnymony · on Aug 19, 2023

Problem is that despots want power, whatever system you have you will get despots wanting to be at the top, and doing what is necessary to get there. Democracy at least moves them around a bit.

nottorp · on Aug 19, 2023

One interesting fact in the article is that for the top spots at least, you did it once for a year and then you were barred from that office for like 10.

funnymony · on Aug 14, 2023

I’ve read something like: there are bunch of observations of extremely dense masses in various suroundings. Talented people can find explanations other than black holes in each case. But that feels adhoc, and black hole fits quite nicely for whole class of observations. Current consensus is that b holes exist.

claytongulick · on Aug 15, 2023

> Current consensus is that b holes exist.

I don't think it works like that?

My read from looking at the literature is that the feeling is more like "black holes are a mathematical theory that have not been falsified by observational evidence, and for which we have some compelling, but inconclusive data, reconstructed from intensely noisy and poor resolution sources. Seems likely, but more data needed."

andrewflnr · on Aug 15, 2023

If you're sufficiently cynical, you can describe almost all astronomy results that way. Lots of alternate explanations for the observations have been proposed, and fail to match the full evidence.

Just how much evidence are you going to demand of phenomena hundreds of lightyears away before you say, "yeah, that's probably what it is"? Anyway, do be sure to read the whole list of observations linked above by csours, as there are quite a few.

funnymony · on July 15, 2023

“bad” might not be the right word. Maybe better “usefulness for current population of developers is very limited”.