Background ProRetail is a catalogue firm that sells products in a number of different catalogues that it owns. The catalogues number in the dozens, but fall into nine basic categories:
1. Clothing 2. Housewares 3. Health 4. Automotive 5. Personal electronics 6. Computers 7. Garden 8. Novelty gift 9. Jewellery
The costs of printing and distributing catalogues are high. By far the biggest cost of operation is the cost of promoting products to people who buy nothing. Having invested so much in the production of artwork and printing of catalogues, ProRetail wants to take every opportunity to use them effectively. One such opportunity is in cross-selling—once a customer has “taken the bait” and purchases one product, try to sell them another while you have their attention. Such cross-promotion might take the form of enclosing a catalogue in the shipment of a purchased product, together with a discount coupon to induce a purchase from that catalogue. Or, it might take the form of a similar coupon sent by e-mail, with a link to the web version of that catalogue. But which catalogue should be enclosed in the box or included as a link in the e-mail with the discount coupon? ProRetail would like it to be an informed choice—a catalogue that has a higher probability of inducing a purchase than simply choosing a catalogue at random.
Use A-priori algorithm to an Association rules analysis, then prepare a report on your methods and results and results for internal use within ProRetail. This should include:
a) Description of all the steps you took to prepare association rules, including the R code that you run.
b) Explanation of the meanings of the various output’s statistics (lift ratio, confidence, support) and the frequent itemsets.
c) Present the candidate rules that can be formed from the selected confidence with some form of visualisation.
d) Discussion of how the association rules analysis results could be useful to the catalogue company to make an informed choice about which catalogue/category to cross-promote to a purchaser
Your final report should be a maximum of 3,000 words, not including R code. It should include an introduction, data and methods, results, recommendation/conclusion, and, if necessary, a reference list. Your R code should be distinguished in the text using Courier New font. This is a quantitative assignment that requires data analysis with R. The dataset to be used for this assignment is a .csv file called ‘CatalogCrossSell’. It contains 10 columns and 4999 rows of data.
This assignment will account for 60% of your final course mark. Marking will be equally weighted across methods, results, and interpretation of results.
Your report will be marked according to the following criteria:
- METHODS: Clear explanation of steps taken, and methods used, correct applicationof analytical methods.• RESULTS: Explanation and presentation of results.• INTERPRETATION OF RESULTS: Evidence of critical thinking, appropriate professionalwriting and presentation style in interpretation of results.
Requirements: 3000 words