ChatGPT解决这个技术问题 Extra ChatGPT

Why does Oracle 9i treat an empty string as NULL?

I know that it does consider ' ' as NULL, but that doesn't do much to tell me why this is the case. As I understand the SQL specifications, ' ' is not the same as NULL -- one is a valid datum, and the other is indicating the absence of that same information.

Feel free to speculate, but please indicate if that's the case. If there's anyone from Oracle who can comment on it, that'd be fantastic!

Feel free to speculate? Somehow I don't think that will provide you with the greatest set of answers..
I suppose not, but I wasn't sure there'd be any certainty on the subject, so I figured I'd throw open the doors. Seems to have worked out okay, so far.

J
Justin Cave

I believe the answer is that Oracle is very, very old.

Back in the olden days before there was a SQL standard, Oracle made the design decision that empty strings in VARCHAR/VARCHAR2 columns were NULL and that there was only one sense of NULL (there are relational theorists that would differentiate between data that has never been prompted for, data where the answer exists but is not known by the user, data where there is no answer, etc. all of which constitute some sense of NULL).

By the time that the SQL standard came around and agreed that NULL and the empty string were distinct entities, there were already Oracle users that had code that assumed the two were equivalent. So Oracle was basically left with the options of breaking existing code, violating the SQL standard, or introducing some sort of initialization parameter that would change the functionality of potentially large number of queries. Violating the SQL standard (IMHO) was the least disruptive of these three options.

Oracle has left open the possibility that the VARCHAR data type would change in a future release to adhere to the SQL standard (which is why everyone uses VARCHAR2 in Oracle since that data type's behavior is guaranteed to remain the same going forward).


B
Brian

Tom Kyte VP of Oracle:

A ZERO length varchar is treated as NULL. '' is not treated as NULL. '' when assigned to a char(1) becomes ' ' (char types are blank padded strings). '' when assigned to a varchar2(1) becomes '' which is a zero length string and a zero length string is NULL in Oracle (it is no long '')


Wow, Tom's pretty snarky. Given that the questions are pertaining to an egregious divergence from SQL92, you'd think he'd be less punchy about it... although he might be tired of answering.
The best thing about Tom is that you get a clear answer, which states exactly what he thinks. Look for some of the comments where people have used text speak on Ask Tom
But it would be more precise if the second line was changed to '' is not always treated as NULL.
@ypercube The quote does not get more precise by changing the word actually used by Tom. If you think Tom worded it confusingly, mmm. Maybe. I think he's spot on. The most confusing situations arise when '' is being implicitely converted to a VARCHAR2, such as cast('' as char(1)) is null which is... surprisingly TRUE
@sehe the confusing bit for me is select 1 from dual where ('' is null)
W
Walter Mitty

Oracle documentation alerts developers to this problem, going back at least as far as version 7.

Oracle chose to represent NULLS by the "impossible value" technique. For example, a NULL in a numeric location will be stored as "minus zero", an impossible value. Any minus zeroes that result from computations will be converted to positive zero before being stored.

Oracle also chose, erroneously, to consider the VARCHAR string of length zero (the empty string) to be an impossible value, and a suitable choice for representing NULL. It turns out that the empty string is far from an impossible value. It's even the identity under the operation of string concatenation!

Oracle documentation warns database designers and developers that some future version of Oracle might break this association between the empty string and NULL, and break any code that depends on that association.

There are techniques to flag NULLS other than impossible values, but Oracle didn't use them.

(I'm using the word "location" above to mean the intersection of a row and a column.)


Oracle documentation warns database designers and developers that some future version of Oracle might break this association between the empty string and NULL, and break any code that depends on that association – could you please provide reference for this statement?
u
user67897

I suspect this makes a lot more sense if you think of Oracle the way earlier developers probably did -- as a glorified backend for a data entry system. Every field in the database corresponded to a field in a form that a data entry operator saw on his screen. If the operator didn't type anything into a field, whether that's "birthdate" or "address" then the data for that field is "unknown". There's no way for an operator to indicate that someone's address is really an empty string, and that doesn't really make much sense anyways.


That only makes sense if you assume that every field in a data entry system is mandatory. A non-answer to a non-mandatory field (e.g. "Dog's Name") is valid, so an empty string still has a distinct purpose from NULL. Even with that assumption in place, I doubt early developers thought of Oracle as a "glorified backend for a data entry system" so I'm not sure this answer makes sense at all.
S
Sorter

According to official 11g docs

Oracle Database currently treats a character value with a length of zero as null. However, this may not continue to be true in future releases, and Oracle recommends that you do not treat empty strings the same as nulls.

Possible reasons

val IS NOT NULL is more readable than val != '' No need to check both conditions val != '' and val IS NOT NULL


In a fully ANSI-compliant database, you don't have to check for both conditions. val <> '' already excludes NULL. Perhaps you meant val = '' OR val IS NULL. But empty strings that don't compare as NULL are useful!
I agree with the comparision part.
A
Alex Kreutznaer

Empty string is the same as NULL simply because its the "lesser evil" when compared to the situation when the two (empty string and null) are not the same.

In languages where NULL and empty String are not the same, one has to always check both conditions.


Simply set not null constraint on your column and check only on empty string.
Checking both conditions is trivial: WHERE Field <> '' returns true only if the field is not NULL and not empty, on databases with ANSI behavior for empty strings.
By having them same you have a uniqueness problem because NULL !== NULL, but '' === '', so I'd argue it's not the lesser evil. It's adding confusion.
z
zloctb

Example from book

   set serveroutput on;   
    DECLARE
    empty_varchar2 VARCHAR2(10) := '';
    empty_char CHAR(10) := '';
    BEGIN
    IF empty_varchar2 IS NULL THEN
    DBMS_OUTPUT.PUT_LINE('empty_varchar2 is NULL');
    END IF;


    IF '' IS NULL THEN
    DBMS_OUTPUT.PUT_LINE(''''' is NULL');
    END IF;

    IF empty_char IS NULL THEN
    DBMS_OUTPUT.PUT_LINE('empty_char is NULL');
    ELSIF empty_char IS NOT NULL THEN
    DBMS_OUTPUT.PUT_LINE('empty_char is NOT NULL');
    END IF;

    END;

e
erikkallen

Because not treating it as NULL isn't particularly helpful, either.

If you make a mistake in this area on Oracle, you usually notice right away. In SQL server, however, it will appear to work, and the problem only appears when someone enters an empty string instead of NULL (perhaps from a .net client library, where null is different from "", but you usually treat them the same).

I'm not saying Oracle is right, but it seems to me that both ways are approximately equally bad.


Much, much more easier to debug. Also, if you see an empty cell or input on the screen, you know the data in the DB is null. In other DBs where ''<>NULL, you can't "see" if the data is null or '', this leads to very sneaky bugs. ''=null it's the sanest option, even if it is not standard.
“In other DBs where ''<>NULL, you can't "see" if the data is null or ''” => Usually, DB tools display NULLs differently from empty strings. Actually, even Oracle SQL Developer shows NULLs as “(null)”. I guess this is to distinguish NULL from whitespace but it is unrelated to the difference between NULL and empty strings.
C
Cade Roux

Indeed, I have had nothing but difficulties in dealing with Oracle, including invalid datetime values (cannot be printed, converted or anything, just looked at with the DUMP() function) which are allowed to be inserted into the database, apparently through some buggy version of the client as a binary column! So much for protecting database integrity!

Oracle handling of NULLs links:

http://digitalbush.com/2007/10/27/oracle-9i-null-behavior/

http://jeffkemponoracle.com/2006/02/empty-string-andor-null.html


invalid datatime values? Not sure what that means. Have you posted this as a question here?
The problem pre-dated stackoverflow - I got no useful information from Oracle forums and I created a workaround - I'll track my notes down and post here.
Posted details as a question here.
y
ypercubeᵀᴹ

First of all, null and null string were not always treated as the same by Oracle. A null string is, by definition, a string containing no characters. This is not at all the same as a null. NULL is, by definition, the absence of data.

Five or six years or so ago, null string was treated differently from null by Oracle. While, like null, null string was equal to everything and different from everything (which I think is fine for null, but totally WRONG for null string), at least length(null string) would return 0, as it should since null string is a string of zero length.

Currently in Oracle, length(null) returns null which I guess is O.K., but length(null string) also returns null which is totally WRONG.

I do not understand why they decided to start treating these 2 distinct "values" the same. They mean different things and the programmer should have the capability of acting on each in different ways. The fact that they have changed their methodology tells me that they really don't have a clue as to how these values should be treated.


Citation required for making a distinction between "null string" and NULL value. In any database except Oracle, a VARCHAR field can have a value (zero or more characters) or no value (NULL), full stop.
"Five or six years ago" from 2011 would fall in the 10g timeframe (10.1 released 2003, 10.2 in 2005). 10g absolutely did not introduce any global changes in the handling of nulls, and there has never been any distinction between NULL and a null valued string, and such a distinction make no sense. I'm afraid this answer is a complete fantasy.