Question 1:
we have stores table that contain columns (Store_id,store_nm,Product)
write query to Find the stores which either sell both tea and coffee or coffee and jam.
Question 2:
we have orders table with the columns (Orderid,Orderdt,custid,Endloc)
write a sql query to return the customers who place the order within 12 days
PySpark Question 3:
we have 2 csv files,one contains department data with dept_name,dept_id columns and second csv file contains students data with studentname,stud_id , deptid , total_marks_secured , year
we need to "return the top 5 stds for each dept for each year" in the output format deptname, studid , stud_name,year,total marks
4.What are facts and dimensions in a data warehouse ?