Get Highest Answer Rate Question

Solution Explanation for LeetCode 578: Get Highest Answer Rate Question This problem requires finding the question with the highest answer rate from a SurveyLog table. The answer rate is calculated as the number of times a question was answered divided by the number of times it was shown. If multiple questions share the highest rate, the one with the smallest question_id should be returned. We'll analyze two SQL solutions: Solution 1: Using SUM() and GROUP BY This solution efficiently calculates the answer rate for each question and then selects the question with the highest rate. MySQL Code: SELECT question_id AS survey_log FROM SurveyLog GROUP BY 1 ORDER BY SUM(action = 'answer') / SUM(action = 'show') DESC, 1 LIMIT 1;:root {--copy-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 48 48'%3E%3Cpath fill='%23adadad' d='M16.187 9.5H12.25a1.75 1.75 0 0 0-1.75 1.75v28.5c0 .967.784 1.75 1.75 1.75h23.5a1.75 1.75 0 0 0 1.75-1.75v-28.5a1.75 1.75 0 0 0-1.75-1.75h-3.937a4.25 4.25 0 0 1-4.063 3h-7.5a4.25 4.25 0 0 1-4.063-3M31.813 7h3.937A4.25 4.25 0 0 1 40 11.25v28.5A4.25 4.25 0 0 1 35.75 44h-23.5A4.25 4.25 0 0 1 8 39.75v-28.5A4.25 4.25 0 0 1 12.25 7h3.937a4.25 4.25 0 0 1 4.063-3h7.5a4.25 4.25 0 0 1 4.063 3M18.5 8.25c0 .966.784 1.75 1.75 1.75h7.5a1.75 1.75 0 1 0 0-3.5h-7.5a1.75 1.75 0 0 0-1.75 1.75'/%3E%3C/svg%3E");--success-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24'%3E%3Cpath fill='%2366ff85' d='M9 16.17L5.53 12.7a.996.996 0 1 0-1.41 1.41l4.18 4.18c.39.39 1.02.39 1.41 0L20.29 7.71a.996.996 0 1 0-1.41-1.41z'/%3E%3C/svg%3E");}pre:has(code) {position: relative;}pre button.rehype-pretty-copy {right: 1px;padding: 0;width: 24px;height: 24px;display: flex;margin-top: 2px;margin-right: 8px;position: absolute;border-radius: 25%;backdrop-filter: blur(3px);& span {width: 100%;aspect-ratio: 1 / 1;}& .ready {background-image: var(--copy-icon);}& .success {display: none; background-image: var(--success-icon);}}&.rehype-pretty-copied {& .success {display: block;} & .ready {display: none;}}pre button.rehype-pretty-copy.rehype-pretty-copied {opacity: 1;& .ready { display: none; }& .success { display: block; }} Explanation: SELECT question_id AS survey_log: This selects the question_id and renames it to survey_log as required by the problem statement. FROM SurveyLog: This specifies the table to query. GROUP BY 1: This groups the rows by question_id (the first column in the SELECT statement). This is crucial for calculating the aggregate sums for each question. SUM(action = 'answer'): This counts the number of rows where the action is 'answer' for each group (question). MySQL treats boolean expressions as integers (1 for true, 0 for false) in aggregate functions. SUM(action = 'show'): Similarly, this counts the number of rows where action is 'show' for each group. SUM(action = 'answer') / SUM(action = 'show'): This calculates the answer rate for each question. We divide the number of answers by the number of times the question was shown. Note that if a question was never shown, this division will result in an error; however, the problem statement implies questions are always shown at least once. ORDER BY SUM(action = 'answer') / SUM(action = 'show') DESC, 1: This orders the results first by the calculated answer rate in descending order (DESC). The , 1 clause is a secondary sorting criterion: if two questions have the same answer rate, it will order them by question_id in ascending order (implicitly ASC), selecting the smaller question_id as per the problem requirements. LIMIT 1: This limits the result set to only the top row—the question with the highest answer rate (or the smallest question_id in case of a tie). Time Complexity: The time complexity is dominated by the GROUP BY operation, which is typically O(N log N) or O(N) depending on the database's optimization strategies, where N is the number of rows in the SurveyLog table. Space Complexity: The space complexity is O(M), where M is the number of distinct question_id values. This is because it needs to store intermediate results for each unique question. Solution 2: Using Window Functions (MySQL) This solution uses window functions to calculate the answer rate for each question in a more concise way. MySQL Code: WITH T AS ( SELECT question_id AS survey_log, (SUM(action = 'answer') OVER (PARTITION BY question_id)) / ( SUM(action = 'show') OVER (PARTITION BY question_id) ) AS ratio FROM SurveyLog ) SELECT survey_log FROM T ORDER BY ratio DESC, 1 LIMIT 1;:root {--copy-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 48 48'%3E%3Cpath fill='%23adadad' d='M16.187 9.5H12.25a1.75 1.75 0 0 0-1.75 1.75v28.5c0 .967.784 1.75 1.75 1.75h23.5a1.75 1.75 0 0 0 1.75-1.75v-28.5a1.75 1.75 0 0 0-1.75-1.75h-3.937a4.25 4.25 0 0 1-4.063 3h-7.5a4.25 4.25 0 0 1-4.063-3M31.813 7h3.937A4.25 4.25 0 0 1 40 11.25v28.5A4.25 4.25 0 0 1 35.75 44h-23.5A4.25 4.25 0 0 1 8 39.75v-28.5A4.25 4.25 0 0 1 12.25 7h3.937a4.25 4.25 0 0 1 4.063-3h7.5a4.25 4.25 0 0 1 4.063 3M18.5 8.25c0 .966.784 1.75 1.75 1.75h7.5a1.75 1.75 0 1 0 0-3.5h-7.5a1.75 1.75 0 0 0-1.75 1.75'/%3E%3C/svg%3E");--success-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24'%3E%3Cpath fill='%2366ff85' d='M9 16.17L5.53 12.7a.996.996 0 1 0-1.41 1.41l4.18 4.18c.39.39 1.02.39 1.41 0L20.29 7.71a.996.996 0 1 0-1.41-1.41z'/%3E%3C/svg%3E");}pre:has(code) {position: relative;}pre button.rehype-pretty-copy {right: 1px;padding: 0;width: 24px;height: 24px;display: flex;margin-top: 2px;margin-right: 8px;position: absolute;border-radius: 25%;backdrop-filter: blur(3px);& span {width: 100%;aspect-ratio: 1 / 1;}& .ready {background-image: var(--copy-icon);}& .success {display: none; background-image: var(--success-icon);}}&.rehype-pretty-copied {& .success {display: block;} & .ready {display: none;}}pre button.rehype-pretty-copy.rehype-pretty-copied {opacity: 1;& .ready { display: none; }& .success { display: block; }} Explanation: WITH T AS (...): This defines a Common Table Expression (CTE) named T. CTEs are useful for breaking down complex queries into smaller, more manageable parts. SELECT question_id AS survey_log, ...: Similar to Solution 1, this selects the question_id. (SUM(action = 'answer') OVER (PARTITION BY question_id)): This uses a window function to calculate the sum of 'answer' actions for each question_id. PARTITION BY question_id ensures that the sum is calculated separately for each question. (SUM(action = 'show') OVER (PARTITION BY question_id)): This does the same for 'show' actions. ... / ... AS ratio: This calculates the answer rate and names it ratio. The rest of the query (SELECT survey_log FROM T ...): This part is similar to Solution 1: it selects the survey_log from the CTE T, orders the results by ratio (descending) and question_id (ascending), and limits the result to one row. Time Complexity: The time complexity is similar to Solution 1, likely O(N log N) or O(N), dominated by the window function operations. Space Complexity: The space complexity is also similar to Solution 1, O(M), to store the intermediate results for each unique question. The CTE adds a small constant overhead. In summary, both solutions effectively solve the problem. Solution 1 is arguably slightly simpler to understand for those less familiar with window functions, while Solution 2 might be considered more elegant and potentially more efficient in some database systems due to the optimized implementation of window functions. The choice depends on personal preference and database system capabilities.

Also Explore

DSA Questions

Managers with at Least 5 Direct Reports

DSA Questions

Find Median Given Frequency of Numbers

DSA Questions

Subtree of Another Tree

DSA Questions

Squirrel Simulation

DSA Questions

Winning Candidate

DSA Questions

Distribute Candies

DSA Questions

Out of Boundary Paths

DSA Questions

Employee Bonus

DSA Questions

Get Highest Answer Rate Question

DSA Questions

Find Cumulative Salary of an Employee

DSA Questions

Count Student Number in Departments

DSA Questions

Shortest Unsorted Continuous Subarray

DSA Questions

Kill Process

DSA Questions

Delete Operation for Two Strings

DSA Questions

Find Customer Referee

DSA Questions

Investments in 2016

DSA Questions

Get Highest Answer Rate Question

Solution Explanation for LeetCode 578: Get Highest Answer Rate Question

Solution 1: Using `SUM()` and `GROUP BY`

Solution 2: Using Window Functions (MySQL)

On This Page

Also Explore

Managers with at Least 5 Direct Reports

Find Median Given Frequency of Numbers

Subtree of Another Tree

Squirrel Simulation

Winning Candidate

Distribute Candies

Out of Boundary Paths

Employee Bonus

Get Highest Answer Rate Question

Find Cumulative Salary of an Employee

Count Student Number in Departments

Shortest Unsorted Continuous Subarray

Kill Process

Delete Operation for Two Strings

Find Customer Referee

Investments in 2016

Customer Placing the Largest Number of Orders