Home > Back-end >  550. Game Play Analysis IV - Why I cannot select from two tables?
550. Game Play Analysis IV - Why I cannot select from two tables?

Time:12-21

#LC 550. Game Play Analysis IV


Column Name Type
player_id int
device_id int
event_date date
games_played int

(player_id, event_date) is the primary key of this table. This table shows the activity of players of some games. Each row is a record of a player who logged in and played a number of games (possibly 0) before logging out on someday using some device.

Request: Write an SQL query to report the fraction of players that logged in again on the day after the day they first logged in, rounded to 2 decimal places. In other words, you need to count the number of players that logged in for at least two consecutive days starting from their first login date, then divide that number by the total number of players.

player_id device_id event_date games_played
1 2 2016-03-01 5
1 2 2016-03-02 6
2 3 2017-06-25 1
3 1 2016-03-02 0
3 4 2018-07-03 5
fraction
0.33

Below is my code:

with cte1 as (
    select a1.player_id as player_id
    from activity a1
    right join activity a2
    on dateadd(day, 1, a1.event_date) = a2.event_date
)


select round(count(distinct cte1.player_id)/count(distinct activity.player_id), 2) as fraction
from activity, cte1

The result should be 0.33 but I got 0. Probably it's because select from two tables (which can work individually). Appreciate it if someone could help me understand why it's wrong. Thank you so much!

CodePudding user response:

multiplying it by 1.0 will auto cast the count() (int) as decimal.

select round(count(distinct cte1.player_id) * 1.0/count(distinct activity.player_id) * 1.0, 2) as fraction
from activity, cte1

CodePudding user response:

You're actually not too far off with your attempt.

Your DBMS might do integer division when all operands of a division are integer. Upcast at least one of the operands, for example by multiplying with 1.0.

But additionally try a left join, that's usually easier to understand and write. You only need one join whereas you join in the CTE and in the outer SELECT.

It's also recommended to always use explicit JOIN syntax instead of commas in the FROM clause. Explicit JOIN syntax is usually easier to understand and write without errors.

SELECT round(count(DISTINCT a2.player_id) * 1.0
             /
             count(DISTINCT a1.player_id) * 1.0,
             2) AS fraction
       FROM activity a1
            LEFT JOIN activity a2
                      ON a1.player_id = a2.player_id
                         AND a1.event_date = dateadd(day, -1, a2.event_date);

db<>fiddle (assuming SQL Server)

  • Related