Queries Quality and Percentage

Table: Queries +-------------+---------+ | Column Name | Type | +-------------+---------+ | query_name | varchar | | result | varchar | | position | int | | rating | int | +-------------+---------+ This table may have duplicate rows. This table contains information collected from some queries on a database. The position column has a value from 1 to 500 . The rating column has a value from 1 to 5 . Query with rating less than 3 is a poor query. We define query quality as: The average of the ratio between query rating and its position. We also define poor query percentage as: The percentage of all queries with rating less than 3. Write a solution to find each query_name, the quality and poor_query_percentage . Both quality and poor_query_percentage should be rounded to 2 decimal places . Return the result table in any order . The result format is in the following example. Example 1: Input: Queries table: +------------+-------------------+----------+--------+ | query_name | result | position | rating | +------------+-------------------+----------+--------+ | Dog | Golden Retriever | 1 | 5 | | Dog | German Shepherd | 2 | 5 | | Dog | Mule | 200 | 1 | | Cat | Shirazi | 5 | 2 | | Cat | Siamese | 3 | 3 | | Cat | Sphynx | 7 | 4 | +------------+-------------------+----------+--------+ Output: +------------+---------+-----------------------+ | query_name | quality | poor_query_percentage | +------------+---------+-----------------------+ | Dog | 2.50 | 33.33 | | Cat | 0.66 | 33.33 | +------------+---------+-----------------------+ Explanation: Dog queries quality is ((5 / 1) + (5 / 2) + (1 / 200)) / 3 = 2.50 Dog queries poor_ query_percentage is (1 / 3) * 100 = 33.33 Cat queries quality equals ((2 / 5) + (3 / 3) + (4 / 7)) / 3 = 0.66 Cat queries poor_ query_percentage is (1 / 3) * 100 = 33.33

Solution Explanation: This problem requires calculating two metrics for each query name in the Queries table: quality and poor_query_percentage. 1. Quality Calculation: The quality is defined as the average of the ratio between the query's rating and its position. For each query, we need to: Calculate the ratio rating / position for each row. Average these ratios across all rows with the same query_name. 2. Poor Query Percentage Calculation: The poor_query_percentage represents the percentage of queries with a rating less than 3 for each query_name. We need to: Count the number of queries with rating < 3 for each query_name. Divide this count by the total number of queries for that query_name. Multiply the result by 100 to get the percentage. SQL Solution: The SQL solution leverages the power of aggregate functions like AVG, SUM, COUNT, and ROUND. The GROUP BY clause is crucial for performing these calculations separately for each query_name. MySQL Code: The provided MySQL code efficiently computes both metrics. Let's break it down: SELECT query_name, ROUND(AVG(rating / position), 2) AS quality, ROUND(AVG(rating < 3) * 100, 2) AS poor_query_percentage FROM Queries WHERE query_name IS NOT NULL GROUP BY 1;:root {--copy-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 48 48'%3E%3Cpath fill='%23adadad' d='M16.187 9.5H12.25a1.75 1.75 0 0 0-1.75 1.75v28.5c0 .967.784 1.75 1.75 1.75h23.5a1.75 1.75 0 0 0 1.75-1.75v-28.5a1.75 1.75 0 0 0-1.75-1.75h-3.937a4.25 4.25 0 0 1-4.063 3h-7.5a4.25 4.25 0 0 1-4.063-3M31.813 7h3.937A4.25 4.25 0 0 1 40 11.25v28.5A4.25 4.25 0 0 1 35.75 44h-23.5A4.25 4.25 0 0 1 8 39.75v-28.5A4.25 4.25 0 0 1 12.25 7h3.937a4.25 4.25 0 0 1 4.063-3h7.5a4.25 4.25 0 0 1 4.063 3M18.5 8.25c0 .966.784 1.75 1.75 1.75h7.5a1.75 1.75 0 1 0 0-3.5h-7.5a1.75 1.75 0 0 0-1.75 1.75'/%3E%3C/svg%3E");--success-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24'%3E%3Cpath fill='%2366ff85' d='M9 16.17L5.53 12.7a.996.996 0 1 0-1.41 1.41l4.18 4.18c.39.39 1.02.39 1.41 0L20.29 7.71a.996.996 0 1 0-1.41-1.41z'/%3E%3C/svg%3E");}pre:has(code) {position: relative;}pre button.rehype-pretty-copy {right: 1px;padding: 0;width: 24px;height: 24px;display: flex;margin-top: 2px;margin-right: 8px;position: absolute;border-radius: 25%;backdrop-filter: blur(3px);& span {width: 100%;aspect-ratio: 1 / 1;}& .ready {background-image: var(--copy-icon);}& .success {display: none; background-image: var(--success-icon);}}&.rehype-pretty-copied {& .success {display: block;} & .ready {display: none;}}pre button.rehype-pretty-copy.rehype-pretty-copied {opacity: 1;& .ready { display: none; }& .success { display: block; }} SELECT query_name, ...: This selects the query_name and the calculated metrics. ROUND(AVG(rating / position), 2): This calculates the average of rating / position for each query_name and rounds it to two decimal places. The AVG function computes the average of the ratios. ROUND(AVG(rating < 3) * 100, 2): This is a clever way to compute the percentage of poor queries. The expression rating < 3 evaluates to 1 (true) or 0 (false). AVG then computes the average of these 1s and 0s, which represents the fraction of poor queries. Multiplying by 100 gives the percentage, and ROUND rounds to two decimal places. FROM Queries: This specifies the table to query. WHERE query_name IS NOT NULL: This filters out any rows where query_name is null, ensuring accurate calculations. GROUP BY 1: This groups the results by query_name, so the aggregations are performed for each unique query name (1 refers to the first column in the SELECT statement). Time Complexity Analysis: The time complexity of this SQL query is dominated by the GROUP BY operation. In the worst case, if there are n rows in the Queries table, the GROUP BY operation will take O(n) time to group the rows based on query_name. The aggregate functions (AVG, SUM, COUNT) then take linear time proportional to the number of rows within each group. Therefore, the overall time complexity is O(n), where n is the number of rows in the Queries table. The sorting involved in GROUP BY is typically done using efficient algorithms, making the overall time complexity linear on average. Space Complexity Analysis: The space complexity depends primarily on the size of the output. In the worst case, if all query_name values are unique, the output will have a size proportional to the number of unique query names in the table. The intermediate space required for the aggregation process also depends on the size of the input, but it is generally considered linear with respect to input size. Hence, the space complexity is O(m) where m is the number of unique query_name values. In the worst-case scenario where every row has a unique query_name, this simplifies to O(n).

Also Explore

DSA Questions

Sort Items by Groups Respecting Dependencies

DSA Questions

Last Person to Fit in the Bus

DSA Questions

Monthly Transactions II

DSA Questions

Design Skiplist

DSA Questions

Unique Number of Occurrences

DSA Questions

Get Equal Substrings Within Budget

DSA Questions

Remove All Adjacent Duplicates in String II

DSA Questions

Minimum Moves to Reach Target with Rotations

DSA Questions

Queries Quality and Percentage

DSA Questions

Team Scores in Football Tournament

DSA Questions

Intersection of Three Sorted Arrays

DSA Questions

Two Sum BSTs

DSA Questions

Stepping Numbers

DSA Questions

Valid Palindrome III

DSA Questions

Minimum Cost to Move Chips to The Same Position

DSA Questions

Longest Arithmetic Subsequence of Given Difference

DSA Questions

Queries Quality and Percentage

Solution Explanation:

On This Page

Also Explore

Sort Items by Groups Respecting Dependencies

Last Person to Fit in the Bus

Monthly Transactions II

Design Skiplist

Unique Number of Occurrences

Get Equal Substrings Within Budget

Remove All Adjacent Duplicates in String II

Minimum Moves to Reach Target with Rotations

Queries Quality and Percentage

Team Scores in Football Tournament

Intersection of Three Sorted Arrays

Two Sum BSTs

Stepping Numbers

Valid Palindrome III

Minimum Cost to Move Chips to The Same Position

Longest Arithmetic Subsequence of Given Difference

Path with Maximum Gold