product crawler

Transcript

product crawler
Manual
Page 1 of 8
PRODUCT CRAWLER
Manual
ALTILIA S.r.l.
Sede opertiva: Piazza Vermicelli – 87036 Rende (CS), Italy
Sede legale: Via A. Volta, 41 – 87036 Rende (CS), Italy
Tel & Fax: +39 0984 494277
E-mail:
[email protected]
Web:
www.altilagroup.com
Used by permission. All rights reserved.
P. IVA e C. Fiscale: 10780921002 Reg. Imprese Cosenza
R.E.A. Cosenza n°209625 - Cap. Soc. versato € 17.395,56
22/04/2014
Manual
PRODUCT CRAWLER
Page 2 of 8
TABLE OF CONTENTS
1
APP documentation ................................................................................... 3
1.1 HOW IT WORKS ........................................ Errore. Il segnalibro non è definito.
1.2 Input data ................................................ Errore. Il segnalibro non è definito.
1.3 Output data ............................................. Errore. Il segnalibro non è definito.
1.4 Basic workflow example ........................... Errore. Il segnalibro non è definito.
2
API documentation .................................................................................... 4
ALTILIA S.r.l.
Sede operativa: Piazza Vermicelli – 87036 Rende (CS), Italy
Sede legale: Via A. Volta, 41 – 87036 Rende (CS), Italy
Tel & Fax: +39 0984 494277
E-mail:
[email protected]
Web:
www.altilagroup.com
Used by permission. All rights reserved.
P. IVA e C. Fiscale: 10780921002 Reg. Imprese Cosenza
R.E.A. Cosenza n°209625 - Cap. Soc. versato € 17.395,56
22/04/2014
Manual
PRODUCT CRAWLER
1
Page 3 of 8
APP DOCUMENTATION
The Product Crawler API service allows to extract all products in web pages reached by
navigating menus on the base of keywords chosen by users. The service provides in output
extracted records as a JSON object.
The Product Crawler GUI is not available at the moment.
You can use it by Product Crawler API services (see next section).
ALTILIA S.r.l.
Sede operativa: Piazza Vermicelli – 87036 Rende (CS), Italy
Sede legale: Via A. Volta, 41 – 87036 Rende (CS), Italy
Tel & Fax: +39 0984 494277
E-mail:
[email protected]
Web:
www.altilagroup.com
Used by permission. All rights reserved.
P. IVA e C. Fiscale: 10780921002 Reg. Imprese Cosenza
R.E.A. Cosenza n°209625 - Cap. Soc. versato € 17.395,56
22/04/2014
PRODUCT CRAWLER
2
Manual
Page 4 of 8
API DOCUMENTATION
The Product Crawler API service allows to extract all products in web pages reached by
navigating menus on the base of keywords chosen by users. The service provides in output
extracted records as a JSON object.
2.1 HOW IT WORKS
To consume the Product Crawler API, you only need to specify the url of an e-commerce web
site and one or more menu names separated by comma.
In the following example we test this service on the walmart.com web site and we wont all
products in the “desktops” category; the service returns the results shown in the last figure
(visual and json format):
Figure 1 - example of input data
ALTILIA S.r.l.
Sede operativa: Piazza Vermicelli – 87036 Rende (CS), Italy
Sede legale: Via A. Volta, 41 – 87036 Rende (CS), Italy
Tel & Fax: +39 0984 494277
E-mail:
[email protected]
Web:
www.altilagroup.com
Used by permission. All rights reserved.
P. IVA e C. Fiscale: 10780921002 Reg. Imprese Cosenza
R.E.A. Cosenza n°209625 - Cap. Soc. versato € 17.395,56
22/04/2014
Manual
PRODUCT CRAWLER
Page 5 of 8
Figure 2 - example of visual output data
ALTILIA S.r.l.
Sede operativa: Piazza Vermicelli – 87036 Rende (CS), Italy
Sede legale: Via A. Volta, 41 – 87036 Rende (CS), Italy
Tel & Fax: +39 0984 494277
E-mail:
[email protected]
Web:
www.altilagroup.com
Used by permission. All rights reserved.
P. IVA e C. Fiscale: 10780921002 Reg. Imprese Cosenza
R.E.A. Cosenza n°209625 - Cap. Soc. versato € 17.395,56
22/04/2014
Manual
PRODUCT CRAWLER
Page 6 of 8
The following is the JSON output format:
{
"categories":[
{
"categoryName":"
<h1><b>Desktops</b></h1>
",
"category":[
{
"record":"<!-- TEST 1--><!-99752068-06c5-4d70-bb25-816bc8a693fb
2a426753-4817-4217-aa2c-3146b7f4ba7b-->
--><!--
<div class="\"quickLook\"">
<div
id="\"ilm_24766511\""
style="0display:
none;"
onclick="\"WALMART.quicklook.LoadQuickView('24766511')\"">
name="\"modal\""
<div class="\"BlueBtn\"" id="\"img_ilm_24766511\"" div="">
<p class="\"clear\""><!-- --></p>
"
},
{
"record":"<!-- TEST 2--> …. and so on …
}
]
}
]
}
2.2 HOW TO USE IT
To consume the Product Crawler API, you will only need to perform a HTTP POST request on the
following endpoint:
http://mantra13dev-prod.apigee.net/mantra-services/product-crawler
sending following arguments in a JSON message:
ALTILIA S.r.l.
Sede operativa: Piazza Vermicelli – 87036 Rende (CS), Italy
Sede legale: Via A. Volta, 41 – 87036 Rende (CS), Italy
Tel & Fax: +39 0984 494277
E-mail:
[email protected]
Web:
www.altilagroup.com
Used by permission. All rights reserved.
P. IVA e C. Fiscale: 10780921002 Reg. Imprese Cosenza
R.E.A. Cosenza n°209625 - Cap. Soc. versato € 17.395,56
22/04/2014
Manual
PRODUCT CRAWLER
Page 7 of 8
Arguments
Description
url
You can specify the url of an e-commerce web
page.
keyWords
Set of menu names/menu entries to browse.
accessToken
Access Token.
The following example illustrates the service request and response in JSON format:
Request
{
“url":"type the url of the single article you want to process",
"keyWords":"menu entries/menu names to browse",
"accessToken":"your access token"
}
Note: Use 'application/json' as header ContentType
Response
{"categories":
[{"category":
[{"categoryName":""},
{"record":""},
{"record":""},
.
.
.
{"record":""}
]}
[{"category":
[{"categoryName":""},
{"record":""},
{"record":""},
.
.
.
{"record":""}
]}
]
ALTILIA S.r.l.
Sede operativa: Piazza Vermicelli – 87036 Rende (CS), Italy
Sede legale: Via A. Volta, 41 – 87036 Rende (CS), Italy
Tel & Fax: +39 0984 494277
E-mail:
[email protected]
Web:
www.altilagroup.com
Used by permission. All rights reserved.
P. IVA e C. Fiscale: 10780921002 Reg. Imprese Cosenza
R.E.A. Cosenza n°209625 - Cap. Soc. versato € 17.395,56
22/04/2014
Manual
PRODUCT CRAWLER
Page 8 of 8
}
You can see the WADL to the following address:
http://mantra13dev-prod.apigee.net/mantra-services/product-crawler
* Note: domain name and port values will be provided with the contract of the service.
For more information about how to use Product Crawler API in your application, send us a
message to [email protected].
ALTILIA S.r.l.
Sede operativa: Piazza Vermicelli – 87036 Rende (CS), Italy
Sede legale: Via A. Volta, 41 – 87036 Rende (CS), Italy
Tel & Fax: +39 0984 494277
E-mail:
[email protected]
Web:
www.altilagroup.com
Used by permission. All rights reserved.
P. IVA e C. Fiscale: 10780921002 Reg. Imprese Cosenza
R.E.A. Cosenza n°209625 - Cap. Soc. versato € 17.395,56
22/04/2014