
## Please consider to cite our paper those results helps you, thanks:
```
================================================
FILE: preprocessing/0rating2inter.ipynb
================================================
{
"cells": [
{
"cell_type": "markdown",
"source": [
"# 从ratings_Sports_and_Outdoors.csv文件中提取U-I交互图, 5-core后重新编号\n",
"- Extracting U-I interactions and performing 5-core, re-indexing\n",
"- dataset located at: http://jmcauley.ucsd.edu/data/amazon/links.html, rating only file in \"Small\" subsets for experimentation"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": 1,
"metadata": {
"collapsed": true
},
"outputs": [],
"source": [
"import os, csv\n",
"import pandas as pd"
]
},
{
"cell_type": "code",
"execution_count": null,
"outputs": [],
"source": [
"os.chdir('/home/enoche/MMRec/Sports14')\n",
"os.getcwd()"
],
"metadata": {
"collapsed": false,
"pycharm": {
"name": "#%%\n",
"is_executing": true
}
}
},
{
"cell_type": "markdown",
"source": [
"## 先5-core过滤\n",
"## 5-core filtering"
],
"metadata": {
"collapsed": false
}
},
{
"cell_type": "code",
"execution_count": 3,
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"shape: (3268695, 4)\n"
]
},
{
"data": {
"text/plain": " userID itemID rating timestamp\n0 A3PMSRCL80KSA1 0000031852 4.0 1388275200\n1 A1SNLWGLFXD70K 0000031852 4.0 1392940800\n2 A1KJ4CVG87QW09 0000031852 4.0 1389657600\n3 AA9ITO6ZLZW6 0000031852 5.0 1399507200\n4 APJ5ULJ1RMZ4 0000031852 1.0 1398556800",
"text/html": "| \n | userID | \nitemID | \nrating | \ntimestamp | \n
|---|---|---|---|---|
| 0 | \nA3PMSRCL80KSA1 | \n0000031852 | \n4.0 | \n1388275200 | \n
| 1 | \nA1SNLWGLFXD70K | \n0000031852 | \n4.0 | \n1392940800 | \n
| 2 | \nA1KJ4CVG87QW09 | \n0000031852 | \n4.0 | \n1389657600 | \n
| 3 | \nAA9ITO6ZLZW6 | \n0000031852 | \n5.0 | \n1399507200 | \n
| 4 | \nAPJ5ULJ1RMZ4 | \n0000031852 | \n1.0 | \n1398556800 | \n
| \n | userID | \nitemID | \nrating | \ntimestamp | \n
|---|---|---|---|---|
| 0 | \nA3PMSRCL80KSA1 | \n0000031852 | \n4.0 | \n1388275200 | \n
| 1 | \nA1SNLWGLFXD70K | \n0000031852 | \n4.0 | \n1392940800 | \n
| 2 | \nA1KJ4CVG87QW09 | \n0000031852 | \n4.0 | \n1389657600 | \n
| \n | userID | \nitemID | \nrating | \ntimestamp | \n
|---|---|---|---|---|
| 564 | \nAIXZKN4ACSKI | \n1881509818 | \n5.0 | \n1390694400 | \n
| 565 | \nA1L5P841VIO02V | \n1881509818 | \n5.0 | \n1328140800 | \n
| \n | userID | \nitemID | \nrating | \ntimestamp | \nx_label | \n
|---|---|---|---|---|---|
| 1 | \n1 | \n0 | \n5.0 | \n1328140800 | \n0 | \n
| 2 | \n2 | \n0 | \n4.0 | \n1330387200 | \n0 | \n
| 3 | \n3 | \n0 | \n4.0 | \n1328400000 | \n0 | \n
| 4 | \n4 | \n0 | \n4.0 | \n1366675200 | \n0 | \n
| 5 | \n5 | \n0 | \n5.0 | \n1351814400 | \n0 | \n
| \n | userID | \nitemID | \nrating | \ntimestamp | \nx_label | \n
|---|---|---|---|---|---|
| 0 | \n1 | \n0 | \n5.0 | \n1328140800 | \n0 | \n
| 1 | \n2 | \n0 | \n4.0 | \n1330387200 | \n0 | \n
| 2 | \n3 | \n0 | \n4.0 | \n1328400000 | \n0 | \n
| 3 | \n4 | \n0 | \n4.0 | \n1366675200 | \n0 | \n
| \n | userID | \nitemID | \nrating | \ntimestamp | \nx_label | \n
|---|---|---|---|---|---|
| 0 | \n1 | \n0 | \n5.0 | \n1328140800 | \n0 | \n
| 1 | \n2 | \n0 | \n4.0 | \n1330387200 | \n0 | \n
| 2 | \n3 | \n0 | \n4.0 | \n1328400000 | \n0 | \n
| 3 | \n4 | \n0 | \n4.0 | \n1366675200 | \n0 | \n
| \n | userID | \nitemID | \nrating | \ntimestamp | \nx_label | \n
|---|---|---|---|---|---|
| 154667 | \n0 | \n11981 | \n2.0 | \n1390694400 | \n1 | \n
| 295557 | \n0 | \n15852 | \n5.0 | \n1390694400 | \n1 | \n
| 189316 | \n0 | \n17787 | \n3.0 | \n1391990400 | \n2 | \n
| 151302 | \n0 | \n0 | \n5.0 | \n1390694400 | \n1 | \n
| 1820 | \n0 | \n3369 | \n5.0 | \n1405123200 | \n2 | \n
| 60040 | \n0 | \n13372 | \n5.0 | \n1391990400 | \n2 | \n
| 199192 | \n0 | \n5458 | \n5.0 | \n1405123200 | \n2 | \n
| 163234 | \n0 | \n3327 | \n3.0 | \n1391990400 | \n2 | \n
| 60837 | \n1 | \n2322 | \n5.0 | \n1337212800 | \n0 | \n
| 233786 | \n1 | \n4123 | \n5.0 | \n1354838400 | \n0 | \n
| 163460 | \n1 | \n14212 | \n5.0 | \n1368230400 | \n0 | \n
| 206628 | \n1 | \n1542 | \n4.0 | \n1302220800 | \n0 | \n
| 261633 | \n1 | \n8802 | \n4.0 | \n1368230400 | \n0 | \n
| 99658 | \n1 | \n9198 | \n5.0 | \n1318377600 | \n0 | \n
| 268935 | \n1 | \n7215 | \n5.0 | \n1285372800 | \n0 | \n
| 77956 | \n1 | \n13468 | \n5.0 | \n1328140800 | \n0 | \n
| 105444 | \n1 | \n2374 | \n5.0 | \n1391558400 | \n1 | \n
| 237889 | \n1 | \n7169 | \n5.0 | \n1302220800 | \n0 | \n
| 173295 | \n1 | \n6677 | \n5.0 | \n1318377600 | \n0 | \n
| 50074 | \n1 | \n15278 | \n5.0 | \n1344902400 | \n0 | \n
| \n | userID | \nitemID | \nrating | \ntimestamp | \nx_label | \n
|---|---|---|---|---|---|
| 154667 | \n0 | \n11981 | \n2.0 | \n1390694400 | \n0 | \n
| 295557 | \n0 | \n15852 | \n5.0 | \n1390694400 | \n0 | \n
| 189316 | \n0 | \n17787 | \n3.0 | \n1391990400 | \n0 | \n
| 151302 | \n0 | \n0 | \n5.0 | \n1390694400 | \n0 | \n
| 1820 | \n0 | \n3369 | \n5.0 | \n1405123200 | \n0 | \n
| 60040 | \n0 | \n13372 | \n5.0 | \n1391990400 | \n0 | \n
| 199192 | \n0 | \n5458 | \n5.0 | \n1405123200 | \n1 | \n
| 163234 | \n0 | \n3327 | \n3.0 | \n1391990400 | \n2 | \n
| 60837 | \n1 | \n2322 | \n5.0 | \n1337212800 | \n0 | \n
| 233786 | \n1 | \n4123 | \n5.0 | \n1354838400 | \n0 | \n
| 163460 | \n1 | \n14212 | \n5.0 | \n1368230400 | \n0 | \n
| 206628 | \n1 | \n1542 | \n4.0 | \n1302220800 | \n0 | \n
| 261633 | \n1 | \n8802 | \n4.0 | \n1368230400 | \n0 | \n
| 99658 | \n1 | \n9198 | \n5.0 | \n1318377600 | \n0 | \n
| 268935 | \n1 | \n7215 | \n5.0 | \n1285372800 | \n0 | \n
| 77956 | \n1 | \n13468 | \n5.0 | \n1328140800 | \n0 | \n
| 105444 | \n1 | \n2374 | \n5.0 | \n1391558400 | \n0 | \n
| 237889 | \n1 | \n7169 | \n5.0 | \n1302220800 | \n0 | \n
| 173295 | \n1 | \n6677 | \n5.0 | \n1318377600 | \n0 | \n
| 50074 | \n1 | \n15278 | \n5.0 | \n1344902400 | \n0 | \n
| \n | userID | \nitemID | \nrating | \ntimestamp | \nx_label | \n
|---|---|---|---|---|---|
| 0 | \n0 | \n11981 | \n2.0 | \n1390694400 | \n0 | \n
| 1 | \n0 | \n15852 | \n5.0 | \n1390694400 | \n0 | \n
| 2 | \n0 | \n17787 | \n3.0 | \n1391990400 | \n0 | \n
| 3 | \n0 | \n0 | \n5.0 | \n1390694400 | \n0 | \n
| 4 | \n0 | \n3369 | \n5.0 | \n1405123200 | \n0 | \n
| 5 | \n0 | \n13372 | \n5.0 | \n1391990400 | \n0 | \n
| 6 | \n0 | \n5458 | \n5.0 | \n1405123200 | \n1 | \n
| 7 | \n0 | \n3327 | \n3.0 | \n1391990400 | \n2 | \n
| 8 | \n1 | \n2322 | \n5.0 | \n1337212800 | \n0 | \n
| 9 | \n1 | \n4123 | \n5.0 | \n1354838400 | \n0 | \n
| 10 | \n1 | \n14212 | \n5.0 | \n1368230400 | \n0 | \n
| 11 | \n1 | \n1542 | \n4.0 | \n1302220800 | \n0 | \n
| 12 | \n1 | \n8802 | \n4.0 | \n1368230400 | \n0 | \n
| 13 | \n1 | \n9198 | \n5.0 | \n1318377600 | \n0 | \n
| 14 | \n1 | \n7215 | \n5.0 | \n1285372800 | \n0 | \n
| 15 | \n1 | \n13468 | \n5.0 | \n1328140800 | \n0 | \n
| 16 | \n1 | \n2374 | \n5.0 | \n1391558400 | \n0 | \n
| 17 | \n1 | \n7169 | \n5.0 | \n1302220800 | \n0 | \n
| 18 | \n1 | \n6677 | \n5.0 | \n1318377600 | \n0 | \n
| 19 | \n1 | \n15278 | \n5.0 | \n1344902400 | \n0 | \n
| \n | asin | \nitemID | \n
|---|---|---|
| 0 | \n1881509818 | \n0 | \n
| 1 | \n2094869245 | \n1 | \n
| 2 | \n7245456259 | \n2 | \n
| 3 | \n7245456313 | \n3 | \n
| \n | asin | \ntitle | \nprice | \nimUrl | \nrelated | \nbrand | \ncategories | \nsalesRank | \ndescription | \n
|---|---|---|---|---|---|---|---|---|---|
| 0 | \n0000032069 | \nAdult Ballet Tutu Cheetah Pink | \n7.89 | \nhttp://ecx.images-amazon.com/images/I/51EzU6qu... | \n{'also_bought': ['0000032050', 'B00D0DJAEG', '... | \nBubuBibi | \n[[Sports & Outdoors, Other Sports, Dance, Clot... | \nNaN | \nNaN | \n
| 1 | \n0000031909 | \nGirls Ballet Tutu Neon Pink | \n7.00 | \nhttp://ecx.images-amazon.com/images/I/41xBoP0F... | \n{'also_bought': ['B002BZX8Z6', 'B00JHONN1S', '... | \nUnknown | \n[[Sports & Outdoors, Other Sports, Dance]] | \n{'Toys & Games': 201847} | \nHigh quality 3 layer ballet tutu. 12 inches in... | \n
| 2 | \n0000032034 | \nAdult Ballet Tutu Yellow | \n7.87 | \nhttp://ecx.images-amazon.com/images/I/21GNUNIa... | \n{'also_bought': ['B00D2JSRFQ', '0000032042', '... | \nBubuBibi | \n[[Sports & Outdoors, Other Sports, Dance, Clot... | \nNaN | \nNaN | \n
| \n | asin | \ntitle | \nprice | \nimUrl | \nrelated | \nbrand | \ncategories | \nsalesRank | \ndescription | \nitemID | \n
|---|---|---|---|---|---|---|---|---|---|---|
| 132 | \n1881509818 | \nGhost Inc Glock Armorers Tool 3/32 Punch | \n9.99 | \nhttp://ecx.images-amazon.com/images/I/21iMxsyD... | \n{'also_bought': ['B000U3YWEM', 'B000U401J6', '... | \nGhost | \n[[Sports & Outdoors, Hunting & Fishing, Huntin... | \n{'Sports & Outdoors': 172909} | \nGhost Armorer Tool (1). The GAT is made with a... | \n0 | \n
| 155 | \n2094869245 | \n5 LED Bicycle Rear Tail Red Bike Torch Laser B... | \n8.26 | \nhttp://ecx.images-amazon.com/images/I/51RtwnJw... | \n{'also_bought': ['B0081O93N2', 'B00EYTCHJA', '... | \n\n | [[Sports & Outdoors, Cycling, Lights & Reflect... | \n{'Sports & Outdoors': 14293} | \nThis newly-designed Laser tail light can emit ... | \n1 | \n
| \n | asin | \ntitle | \nprice | \nimUrl | \nrelated | \nbrand | \ncategories | \nsalesRank | \ndescription | \nitemID | \n
|---|---|---|---|---|---|---|---|---|---|---|
| 132 | \n1881509818 | \nGhost Inc Glock Armorers Tool 3/32 Punch | \n9.99 | \nhttp://ecx.images-amazon.com/images/I/21iMxsyD... | \n{'also_bought': ['B000U3YWEM', 'B000U401J6', '... | \nGhost | \n[[Sports & Outdoors, Hunting & Fishing, Huntin... | \n{'Sports & Outdoors': 172909} | \nGhost Armorer Tool (1). The GAT is made with a... | \n0 | \n
| 155 | \n2094869245 | \n5 LED Bicycle Rear Tail Red Bike Torch Laser B... | \n8.26 | \nhttp://ecx.images-amazon.com/images/I/51RtwnJw... | \n{'also_bought': ['B0081O93N2', 'B00EYTCHJA', '... | \n\n | [[Sports & Outdoors, Cycling, Lights & Reflect... | \n{'Sports & Outdoors': 14293} | \nThis newly-designed Laser tail light can emit ... | \n1 | \n
| 201 | \n7245456259 | \nBlack Mountain Products Single Resistance Band... | \n10.49 | \nhttp://ecx.images-amazon.com/images/I/411Ikpf1... | \n{'also_bought': ['B00DDBS2JE', 'B00H1KNHE8', '... | \nBlack Mountain | \n[[Sports & Outdoors, Exercise & Fitness, Acces... | \n{'Sports & Outdoors': 1010} | \nBlack Mountain Products single resistance band... | \n2 | \n
| \n | itemID | \nasin | \ntitle | \nprice | \nimUrl | \nrelated | \nbrand | \ncategories | \nsalesRank | \ndescription | \n
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n0 | \n1881509818 | \nGhost Inc Glock Armorers Tool 3/32 Punch | \n9.99 | \nhttp://ecx.images-amazon.com/images/I/21iMxsyD... | \n{'also_bought': ['B000U3YWEM', 'B000U401J6', '... | \nGhost | \n[['Sports & Outdoors', 'Hunting & Fishing', 'H... | \n{'Sports & Outdoors': 172909} | \nGhost Armorer Tool (1). The GAT is made with a... | \n
| 1 | \n1 | \n2094869245 | \n5 LED Bicycle Rear Tail Red Bike Torch Laser B... | \n8.26 | \nhttp://ecx.images-amazon.com/images/I/51RtwnJw... | \n{'also_bought': ['B0081O93N2', 'B00EYTCHJA', '... | \nNaN | \n[['Sports & Outdoors', 'Cycling', 'Lights & Re... | \n{'Sports & Outdoors': 14293} | \nThis newly-designed Laser tail light can emit ... | \n
| 2 | \n2 | \n7245456259 | \nBlack Mountain Products Single Resistance Band... | \n10.49 | \nhttp://ecx.images-amazon.com/images/I/411Ikpf1... | \n{'also_bought': ['B00DDBS2JE', 'B00H1KNHE8', '... | \nBlack Mountain | \n[['Sports & Outdoors', 'Exercise & Fitness', '... | \n{'Sports & Outdoors': 1010} | \nBlack Mountain Products single resistance band... | \n
| 3 | \n3 | \n7245456313 | \nBlack Mountain Products Resistance Band Set wi... | \n32.99 | \nhttp://ecx.images-amazon.com/images/I/51FdHlZS... | \n{'also_bought': ['1612431712', 'B00GSBMW2Y', '... | \nBlack Mountain | \n[['Sports & Outdoors', 'Exercise & Fitness', '... | \n{'Sports & Outdoors': 15} | \n[if gte mso 9]><xml> <o:OfficeDocumentSettings... | \n
| \n | itemID | \nasin | \ntitle | \nprice | \nimUrl | \nrelated | \nbrand | \ncategories | \nsalesRank | \ndescription | \n
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n0 | \n1881509818 | \nGhost Inc Glock Armorers Tool 3/32 Punch | \n9.99 | \nhttp://ecx.images-amazon.com/images/I/21iMxsyD... | \n{'also_bought': ['B000U3YWEM', 'B000U401J6', '... | \nGhost | \n[['Sports & Outdoors', 'Hunting & Fishing', 'H... | \n{'Sports & Outdoors': 172909} | \nGhost Armorer Tool (1). The GAT is made with a... | \n
| 1 | \n1 | \n2094869245 | \n5 LED Bicycle Rear Tail Red Bike Torch Laser B... | \n8.26 | \nhttp://ecx.images-amazon.com/images/I/51RtwnJw... | \n{'also_bought': ['B0081O93N2', 'B00EYTCHJA', '... | \nNaN | \n[['Sports & Outdoors', 'Cycling', 'Lights & Re... | \n{'Sports & Outdoors': 14293} | \nThis newly-designed Laser tail light can emit ... | \n
| 2 | \n2 | \n7245456259 | \nBlack Mountain Products Single Resistance Band... | \n10.49 | \nhttp://ecx.images-amazon.com/images/I/411Ikpf1... | \n{'also_bought': ['B00DDBS2JE', 'B00H1KNHE8', '... | \nBlack Mountain | \n[['Sports & Outdoors', 'Exercise & Fitness', '... | \n{'Sports & Outdoors': 1010} | \nBlack Mountain Products single resistance band... | \n
| \n | itemID | \nasin | \ntitle | \nprice | \nimUrl | \nrelated | \nbrand | \ncategories | \nsalesRank | \ndescription | \n
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n0 | \n1881509818 | \nGhost Inc Glock Armorers Tool 3/32 Punch | \n9.99 | \nhttp://ecx.images-amazon.com/images/I/21iMxsyD... | \n{'also_bought': ['B000U3YWEM', 'B000U401J6', '... | \nGhost | \n[['Sports & Outdoors', 'Hunting & Fishing', 'H... | \n{'Sports & Outdoors': 172909} | \nGhost Armorer Tool (1). The GAT is made with a... | \n
| 1 | \n1 | \n2094869245 | \n5 LED Bicycle Rear Tail Red Bike Torch Laser B... | \n8.26 | \nhttp://ecx.images-amazon.com/images/I/51RtwnJw... | \n{'also_bought': ['B0081O93N2', 'B00EYTCHJA', '... | \n\n | [['Sports & Outdoors', 'Cycling', 'Lights & Re... | \n{'Sports & Outdoors': 14293} | \nThis newly-designed Laser tail light can emit ... | \n
| 2 | \n2 | \n7245456259 | \nBlack Mountain Products Single Resistance Band... | \n10.49 | \nhttp://ecx.images-amazon.com/images/I/411Ikpf1... | \n{'also_bought': ['B00DDBS2JE', 'B00H1KNHE8', '... | \nBlack Mountain | \n[['Sports & Outdoors', 'Exercise & Fitness', '... | \n{'Sports & Outdoors': 1010} | \nBlack Mountain Products single resistance band... | \n
| 3 | \n3 | \n7245456313 | \nBlack Mountain Products Resistance Band Set wi... | \n32.99 | \nhttp://ecx.images-amazon.com/images/I/51FdHlZS... | \n{'also_bought': ['1612431712', 'B00GSBMW2Y', '... | \nBlack Mountain | \n[['Sports & Outdoors', 'Exercise & Fitness', '... | \n{'Sports & Outdoors': 15} | \n[if gte mso 9]><xml> <o:OfficeDocumentSettings... | \n
| 4 | \n4 | \nB000002NUS | \nOuters Universal 32-Piece Blow Molded Gun Clea... | \n21.99 | \nhttp://ecx.images-amazon.com/images/I/510GjWgd... | \n{'also_bought': ['B000PW64JY', 'B0010KHNEU', '... | \nOuters | \n[['Sports & Outdoors', 'Hunting & Fishing', 'H... | \n{'Sports & Outdoors': 26738} | \nOuters now offers this rigid and durable hard ... | \n