- September 12, 2020

ISE 529 Predictive Analytics Homework 1 Due online on September 4, 2020 The homes.xlsx data set (available from Blackboard) shows attributes from a sample of 522 residential houses. Houses price, lotsize, beds, baths, garage and year are numerical variables. Consider style, ac, pool, quality, and (proximity to a) highway as categorical variables. Categorical variables include categories only. For example, (house with a) pool, with categories yes and no. This assignment helps you practice basic data operations useful to analyze a data set. Use MS Excel to answer the following questions. The first two questions give an overview of the dataset. The other eight can be regarded as those asked by a customer to a real state agent. Please submit your report as a MS Word or pdf file onto Blackboard. Avoid screen captures in your report. 1. Find the number of categories for each categorical variable. 2. Find the smallest, median, and largest value for each numerical variable. 3. Find the most expensive house with at least three bedrooms. 4. There are houses with different styles. a) Find the number of houses from each style. b) Find the median price of houses for each style. 5. Create a new spreadsheet with all houses having two to four bedrooms with area exceeding 3000 square feet and price not less than 350,000 dollars. Include the spreadsheet in your report. 6. Create a new spreadsheet showing the least and most expensive houses by style. Include the spreadsheet in your report. 7. What is the average area and lot size of houses built after 1970. 8. What is the difference in average price of houses with two and three baths. 9. Report the smallest and largest price of houses by quality category. 10. Create a two-way table showing the number of hourses classified by style (rows) and by number of baths (columns). Row headers are the styles and column headers are the number of baths. 1