Find Jobs
Hire Freelancers

Build a Website -- 2

$250-750 USD

Törölve
Kiadva ekkor: több mint 7 évvel ezelőtt

$250-750 USD

Teljesítéskor fizetve
Hi there. I need someone to write a program and auto catch data from this site : [login to view URL] Not all the data. Look at the category list in the right site first. I need atuo catch some categories' data. Some of them are under [恋愛・少女漫画], from [ちはやふる] to [すきっていいなよ。]. Rest fo them are under [ファンタジー漫画], from [終りのセラフ] to [クレイモア]. You have to catch data in the way I need. Data Structure id----order url----URL of the article cat----category title----title magazine----magazine author----author genre----genre character----character site----goal website article----the text entry_data_at----publish time created_at----catch time picture----the cover the article *Check tip1,tip2,tip3 [cat],means the name of [login to view URL] the category list in the right [login to view URL] can see words like [ちはやふる] and [黒崎くんの言いなりになんてならない],they are categorise. [title],check the category [ちはやふる],turns to a new page,you can see words such as [ちはやふる33巻173首のネタバレ感想] or [ちはやふる33巻172首のネタバレ感想], they are titles. [article],check one title like [ちはやふる33巻173首のネタバレ感想],turns to a new page,you can see an article with lot of [login to view URL] have to catch the body which from the title(ちはやふる33巻173首のネタバレ感想) to the end of the article (end at the place above [目次][コメント] and advertisements). [entry_data_at],means the publish time of the articel,for example,the publish time of ちはやふる33巻173首のネタバレ感想 is the one written under the title - 2016/10/[login to view URL] have to record it by using timestamp,which would turn 2016/10/01 into 1451577600. [url],means the url of the article,like [login to view URL] [site],all write as [login to view URL] [character],for example,[login to view URL],under advertisements,there is a [目次] [login to view URL] can see [33巻173首] write in black and has no [login to view URL]'s the [character] About [author],[magazine],[genre],[picutre],[id] and [created_at],should do the following step first. Search [cat] in [login to view URL],use the first result. For example,search [ちはやふる] in [login to view URL],you can get: 作家:末次由紀 雑誌・レーベル: BE・LOVE ジャンル: スポーツ / 少女マンガ / アニメ化 / 映画化 So, [author],means the words after [作家:]. In the example the [author] is [末次由紀]. [magazine],means the words after [雑誌・レーベル:], In the example the [magazine] is [BE・LOVE]. [genre],means the words after [genre:],need to use "," to separate them. In the example the [genre] is [スポーツ,少女マンガ,アニメ化,映画化]. [pitucre],the cover of the first [login to view URL] have to catch covers and store [login to view URL] the datebase there should add a data bar of [pictuer] and have url of each cover. [id],means the order, the first one is 1, the second one is 2, etc. [created_at],means the time you catch the article,also have to record by using timestamp. For example,if I catch the date on UTC/GMT+08:00 2016/10/11 14:40:30, so the [created_at] should be 1476168030. Use [ちはやふる] as the example, do what I said,you can get: id:1 url:[login to view URL] cat:ちはやふる title:ちはやふる33巻173首のネタバレ感想 magazine:BE・LOVE author:末次由紀 genre:スポーツ,少女マンガ,アニメ化,映画化 character:33巻173首 site:[login to view URL] article:<h1 class="entry-title">.......... picture: ... entry_data_at:1451577600 created_at:1476168030 *Check the datebase sample photo This is what I [login to view URL] have to do in this way to make my server can recognize the data. Need to catch data 2 hours one [login to view URL] to send me the program you write to catch data. Please tab 1234 in your bid.
Projektazonosító: 11775055

A projektről

Távolról teljesíthető projekt
Aktiválva: 8 évvel ezelőtt

Szeretne pénzt keresni?

A Freelancer oldalán történő árajánlatadás előnyei

Határozzon meg költségvetést és időkeretet
Kapja meg fizetését a munkáért
Vázolja ajánlatát
Ingyen regisztrálhat és adhat árajánlatot munkákra

Az ügyfélről

CHINA zászlója
kojima, China
5,0
25
Fizetési mód hitelesítve
Tagság kezdete: máj. 5, 2016

Ügyfél-hitelesítés

Köszönjük! E-mailben elküldtük a linket, melyen átveheti ajándék egyenlegét.
E-mailje elküldése során valami hiba történt. Kérjük, próbálja újra.
Regisztrált Felhasználók Összes Közzétett Munka
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Előnézet betöltése
Hozzáférést adott a helymeghatározáshoz.
Belépési munkamenete lejárt, és kijelentkeztettük. Kérjük, lépjen be újra.