Unique Orders and Customers Per Month

Solution Explanation: The problem requires retrieving the number of unique orders and unique customers per month, considering only orders with invoices exceeding $20. The solution uses a combination of filtering and aggregation. Approach: Filtering: The initial step involves filtering the Orders table to include only those entries where the invoice amount is greater than 20. This is crucial because the problem's constraints specify that only these orders should be considered. Grouping: After filtering, the data is grouped by month. The month needs to be extracted from the order_date column. The specific method for extracting the month depends on the database system used (e.g., DATE_FORMAT in MySQL, .dt.to_period("M") in Pandas). Aggregation: For each month group, we perform the following aggregations: order_count: The total number of orders (using COUNT(order_id)). customer_count: The number of unique customers (using COUNT(DISTINCT customer_id)). Result: Finally, the aggregated results are returned in a table format, with columns representing the month, order count, and customer count. Time Complexity Analysis: The time complexity of this solution is dominated by the grouping and aggregation operations. The exact complexity depends on the database system's implementation, but it generally scales linearly with the number of rows in the Orders table (O(N)), where N is the number of rows. Filtering is also a linear operation. Therefore, the overall time complexity is O(N). Space Complexity Analysis: The space complexity is determined by the size of the intermediate result set created during the grouping and aggregation process. In the worst-case scenario, if every month has a unique entry, the space complexity will be proportional to the number of unique months. However, this is generally much smaller than the size of the original Orders table. Therefore, space complexity is considered to be relatively low and can be approximated as O(M), where M is the number of unique months. Code in Different Languages: MySQL: SELECT DATE_FORMAT(order_date, '%Y-%m') AS month, COUNT(order_id) AS order_count, COUNT(DISTINCT customer_id) AS customer_count FROM Orders WHERE invoice > 20 GROUP BY 1;:root {--copy-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 48 48'%3E%3Cpath fill='%23adadad' d='M16.187 9.5H12.25a1.75 1.75 0 0 0-1.75 1.75v28.5c0 .967.784 1.75 1.75 1.75h23.5a1.75 1.75 0 0 0 1.75-1.75v-28.5a1.75 1.75 0 0 0-1.75-1.75h-3.937a4.25 4.25 0 0 1-4.063 3h-7.5a4.25 4.25 0 0 1-4.063-3M31.813 7h3.937A4.25 4.25 0 0 1 40 11.25v28.5A4.25 4.25 0 0 1 35.75 44h-23.5A4.25 4.25 0 0 1 8 39.75v-28.5A4.25 4.25 0 0 1 12.25 7h3.937a4.25 4.25 0 0 1 4.063-3h7.5a4.25 4.25 0 0 1 4.063 3M18.5 8.25c0 .966.784 1.75 1.75 1.75h7.5a1.75 1.75 0 1 0 0-3.5h-7.5a1.75 1.75 0 0 0-1.75 1.75'/%3E%3C/svg%3E");--success-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24'%3E%3Cpath fill='%2366ff85' d='M9 16.17L5.53 12.7a.996.996 0 1 0-1.41 1.41l4.18 4.18c.39.39 1.02.39 1.41 0L20.29 7.71a.996.996 0 1 0-1.41-1.41z'/%3E%3C/svg%3E");}pre:has(code) {position: relative;}pre button.rehype-pretty-copy {right: 1px;padding: 0;width: 24px;height: 24px;display: flex;margin-top: 2px;margin-right: 8px;position: absolute;border-radius: 25%;backdrop-filter: blur(3px);& span {width: 100%;aspect-ratio: 1 / 1;}& .ready {background-image: var(--copy-icon);}& .success {display: none; background-image: var(--success-icon);}}&.rehype-pretty-copied {& .success {display: block;} & .ready {display: none;}}pre button.rehype-pretty-copy.rehype-pretty-copied {opacity: 1;& .ready { display: none; }& .success { display: block; }} This MySQL query directly performs the filtering, grouping, and aggregation in a single SQL statement. DATE_FORMAT extracts the year and month, COUNT counts orders, and COUNT(DISTINCT) counts unique customers. The GROUP BY 1 clause groups the results by the first column (month). Pandas (Python): import pandas as pd def unique_orders_and_customers(orders: pd.DataFrame) -> pd.DataFrame: filtered_orders = orders[orders["invoice"] > 20] filtered_orders["month"] = filtered_orders["order_date"].dt.to_period("M").astype(str) result = ( filtered_orders.groupby("month") .agg( order_count=("order_id", "count"), customer_count=("customer_id", "nunique") ) .reset_index() ) return result :root {--copy-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 48 48'%3E%3Cpath fill='%23adadad' d='M16.187 9.5H12.25a1.75 1.75 0 0 0-1.75 1.75v28.5c0 .967.784 1.75 1.75 1.75h23.5a1.75 1.75 0 0 0 1.75-1.75v-28.5a1.75 1.75 0 0 0-1.75-1.75h-3.937a4.25 4.25 0 0 1-4.063 3h-7.5a4.25 4.25 0 0 1-4.063-3M31.813 7h3.937A4.25 4.25 0 0 1 40 11.25v28.5A4.25 4.25 0 0 1 35.75 44h-23.5A4.25 4.25 0 0 1 8 39.75v-28.5A4.25 4.25 0 0 1 12.25 7h3.937a4.25 4.25 0 0 1 4.063-3h7.5a4.25 4.25 0 0 1 4.063 3M18.5 8.25c0 .966.784 1.75 1.75 1.75h7.5a1.75 1.75 0 1 0 0-3.5h-7.5a1.75 1.75 0 0 0-1.75 1.75'/%3E%3C/svg%3E");--success-icon: url("data:image/svg+xml,%3Csvg xmlns='http://www.w3.org/2000/svg' viewBox='0 0 24 24'%3E%3Cpath fill='%2366ff85' d='M9 16.17L5.53 12.7a.996.996 0 1 0-1.41 1.41l4.18 4.18c.39.39 1.02.39 1.41 0L20.29 7.71a.996.996 0 1 0-1.41-1.41z'/%3E%3C/svg%3E");}pre:has(code) {position: relative;}pre button.rehype-pretty-copy {right: 1px;padding: 0;width: 24px;height: 24px;display: flex;margin-top: 2px;margin-right: 8px;position: absolute;border-radius: 25%;backdrop-filter: blur(3px);& span {width: 100%;aspect-ratio: 1 / 1;}& .ready {background-image: var(--copy-icon);}& .success {display: none; background-image: var(--success-icon);}}&.rehype-pretty-copied {& .success {display: block;} & .ready {display: none;}}pre button.rehype-pretty-copy.rehype-pretty-copied {opacity: 1;& .ready { display: none; }& .success { display: block; }} The Pandas solution mirrors the approach. It first filters the DataFrame, then extracts the month using the .dt.to_period("M") accessor. The .groupby() method groups the data, and the .agg() method performs the aggregation using count and nunique functions. reset_index() converts the grouped result back to a DataFrame. This detailed explanation provides a comprehensive understanding of the solution to the problem, its complexities, and its implementation in different programming languages.

Also Explore

DSA Questions

Minimum Number of Vertices to Reach All Nodes

DSA Questions

Minimum Numbers of Function Calls to Make Target Array

DSA Questions

Detect Cycles in 2D Grid

DSA Questions

Most Visited Sector in a Circular Track

DSA Questions

Maximum Number of Coins You Can Get

DSA Questions

Find Latest Group of Size M

DSA Questions

Stone Game V

DSA Questions

Put Boxes Into the Warehouse I

DSA Questions

Unique Orders and Customers Per Month

DSA Questions

Detect Pattern of Length M Repeated K or More Times

DSA Questions

Maximum Length of Subarray With Positive Product

DSA Questions

Minimum Number of Days to Disconnect Island

DSA Questions

Number of Ways to Reorder Array to Get Same BST

DSA Questions

Dot Product of Two Sparse Vectors

DSA Questions

Warehouse Manager

DSA Questions

Matrix Diagonal Sum

DSA Questions

Unique Orders and Customers Per Month

Solution Explanation:

Approach:

Time Complexity Analysis:

Space Complexity Analysis:

Code in Different Languages:

MySQL:

Pandas (Python):

On This Page

Also Explore

Minimum Number of Vertices to Reach All Nodes

Minimum Numbers of Function Calls to Make Target Array

Detect Cycles in 2D Grid

Most Visited Sector in a Circular Track

Maximum Number of Coins You Can Get

Find Latest Group of Size M

Stone Game V

Put Boxes Into the Warehouse I

Unique Orders and Customers Per Month

Detect Pattern of Length M Repeated K or More Times

Maximum Length of Subarray With Positive Product

Minimum Number of Days to Disconnect Island

Number of Ways to Reorder Array to Get Same BST

Dot Product of Two Sparse Vectors

Warehouse Manager

Matrix Diagonal Sum

Number of Ways to Split a String