Thread: mysql code questions

mysql code questions

From

Ray Stell

Date:

12 August 2009, 15:24:44

http://www.brentozar.com/archive/2009/04/getting-the-most-recent-record/
How this works?  What is ttNewer?  What is a clustered primary key in mysql? 
This is as good as I can do to get this into pg:

create table TestTable (id int not null,create_date date not null,info1 VARCHAR(50) NOT NULL,info2 VARCHAR(50) NOT
NULL,constraintPK_TestTable PRIMARY KEY(id,create_date)
 
);

INSERT INTO TestTable (id, create_date, info1, info2)
VALUES (1, '1/1/09', 'Blue', 'Green');
INSERT INTO TestTable (id, create_date, info1, info2)
VALUES (1, '1/2/09', 'Red', 'Yellow');
INSERT INTO TestTable (id, create_date, info1, info2)
VALUES (1, '1/3/09', 'Orange', 'Purple');
INSERT INTO TestTable (id, create_date, info1, info2)
VALUES (2, '1/1/09', 'Yellow', 'Blue');
INSERT INTO TestTable (id, create_date, info1, info2)
VALUES (2, '1/5/09', 'Blue', 'Orange');
INSERT INTO TestTable (id, create_date, info1, info2)
VALUES (3, '1/2/09', 'Green', 'Purple');
INSERT INTO TestTable (id, create_date, info1, info2)
VALUES (3, '1/8/09', 'Red', 'Blue');

select tt.*  FROM TestTable tt LEFT OUTER JOIN TestTable ttNewer       ON tt.id = ttNewer.id AND tt.create_date <
ttNewer.create_dateWHERE ttNewer.id IS NULL;
 
id | create_date | info1  | info2  
----+-------------+--------+-------- 1 | 2009-01-03  | Orange | Purple 2 | 2009-01-05  | Blue   | Orange 3 | 2009-01-08
| Red    | Blue
 
(3 rows)

Re: mysql code questions

From

Thomas Kellerer

Date:

12 August 2009, 15:52:50

Ray Stell wrote on 12.08.2009 20:19:
> http://www.brentozar.com/archive/2009/04/getting-the-most-recent-record/
> How this works?  What is ttNewer?  What is a clustered primary key in mysql? 
That article talks about SQL Server not MySQL.

> select tt.*  FROM TestTable tt
>   LEFT OUTER JOIN TestTable ttNewer    
>     ON tt.id = ttNewer.id AND tt.create_date < ttNewer.create_date
>   WHERE ttNewer.id IS NULL;

I would probably do it this way:

SELECT tt. *
FROM testtable tt
WHERE create_date = (SELECT MAX(create_date)                     FROM testtable tt2                     WHERE tt.id =
tt2.id);

Don't know which one is more efficient (with just a few rows, it doesn't really pay off to look at 
the execution plan)

Thomas

Re: mysql code questions

From

Greg Stark

Date:

12 August 2009, 16:01:13

On Wed, Aug 12, 2009 at 7:52 PM, Thomas Kellerer<spam_eater@gmx.net> wrote:
> Ray Stell wrote on 12.08.2009 20:19:
> I would probably do it this way:
>
> SELECT tt. *
> FROM testtable tt
> WHERE create_date = (SELECT MAX(create_date)
>                     FROM testtable tt2
>                     WHERE tt.id = tt2.id);

The article is *very* unclear about the problem. You have to read the
whole schema and their proposed solution to figure out what the
problem they're trying to solve is. This query doesn't solve it.

They want the newest record for *each* id. Ie, the equivalent of
"select id,max(create_date) from tt group by id" except including the
other attributes for the record with that date.

There are a number of ways to solve it. Postgres supports all of the
solutions in the comments including the one proposed in the post as
well as the window-function based solution.

It also supports an extension which could be even more efficient than
all of them:

select DISTINCT ON (id) from tt order by id,create_date desc

If you have an index on <id,create_date desc> (Or if you have an index
on <id,create_date> and use "order by id desc, create_date desc" in
the query) then this could do a single index scan.

--
greg
http://mit.edu/~gsstark/resume.pdf