Find Jobs
Hire Freelancers

Implement python code to generate term-docid pairs for a small startup project.

$10-30 CAD

Lezárt
Kiadva ekkor: 6 hónappal ezelőtt

$10-30 CAD

Teljesítéskor fizetve
I am looking for a freelancer who can help implement python code for my small startup project. The project involves generating term-docid pairs for a dataset of less than 1000 documents. Skills and experience required for this job include: - Intermediate level python coding expertise - Knowledge of working with small datasets - Ability to generate term-docid pairs efficiently The code does not need to be extensively commented for future reference. Sub Project 1: Use reuters, the module that while there are still more documents to be processed, accepts a document as a list of tokens (omit punctuation) and outputs term-docID pairs. Instead of appending new term-docID pairings to a list, make sure you directly append the docID to the postings list for the term. You may use a hash table. No boxes required. (a) compare timing of this SPIMI inspired procedure with the naive indexer (for 10000 term-docID pairings). (b) compile an inverted index for Reuters21578 without using any compression techniques docID hint: Use the NEWID values from the Reuters corpus to make your retrieval comparable. Subproject II: Convert your indexer into a probabilistic search engine 1. using the assumptions made about independence of terms and documents etc. and 2. using the BM25 formula 3. rank the documents your index returns and 4. for a given query, return a ranked list of results. Notes: experiment with different values for the parameters k1 and b. Test queries: 1. design four test queries: (a) a single keyword query. Compare results for the same queries of Subproject I with the results for your Nave indexer (b) a multiple keyword query for Subproject I returning documents containing all the keywords (AND) for unranked retrieval (c) a multiple keywords query returning documents containing at least one keyword (OR), where documents are ordered by how many keywords they contain) (d) a query consisting of several keywords for ranking with BM25 2. run your four test queries to showcase your code and comment on the results in your report Deliverables: well documented sample runs for your queries on the information needs: (a) Democrats’ welfare and healthcare reform policies (b) Drug company bankruptcies (c) George Bush 4. any additional testing or aborted design ideas that show off particular aspects of your project.
Projektazonosító: 37395082

A projektről

5 ajánlat
Távolról teljesíthető projekt
Aktiválva: 5 hónappal ezelőtt

Szeretne pénzt keresni?

A Freelancer oldalán történő árajánlatadás előnyei

Határozzon meg költségvetést és időkeretet
Kapja meg fizetését a munkáért
Vázolja ajánlatát
Ingyen regisztrálhat és adhat árajánlatot munkákra
5 szabadúszó adott átlagosan $24 CAD összegű árajánlatot erre a munkára
Felhasználó avatár
Hi there, I am Hamza and I am excited to hear that you are looking for someone to help implement code for your small startup project. With my extensive experience in python coding and data processing specifically related to generating term-docid pairs, I believe I am the perfect fit for this job. I have intermediate level python coding expertise as well as knowledge of working with small datasets which will be beneficial in efficiently generating term-docid pairs for your project. Additionally, I am confident that my skillset can be used to create an effective probabilistic search engine from your indexer so that you can get fast results from your searches without having to manually append new term-docid pairings to a list - this is exactly what SPIMI inspired procedure does for Reuters21578 dataset (see Subproject 1).
$50 CAD 1 napon belül
4,9 (83 értékelés)
5,6
5,6
Felhasználó avatár
Hello there, I’m a python expert and I have a great experience with python libraries that your needs for your project. I can handle this project as required. Please contact me for more details. Best Regards !
$20 CAD 2 napon belül
4,9 (18 értékelés)
4,1
4,1
Felhasználó avatár
hI,i can deliver you the work within hours as i have a strong background in python and can efficiently generate term-docid pairs for your project. My experience in python coding enables me to develop an efficient code that can be used to generate term-docid pairs for your dataset within hours. Additionally, my expertise in working with small datasets enables me to efficiently generate term-docid pairs for your project without any issues. I understand the need for collaboration and effective communication when it comes to software projects. As such, I am committed to engaging in open dialogue and actively seeking feedback from clients during the project development process. This ensures that my work aligns with the client's vision and exceeds their expectations. I look forward to hearing from you soon!
$20 CAD 2 napon belül
4,9 (11 értékelés)
3,6
3,6
Felhasználó avatár
I understand that you are looking for someone to help implement python code for your small startup project involving generating term-docid pairs for a dataset of less than 1000 documents. I have extensive experience in python coding, specifically intermediate level Python expertise, knowledge of working with small datasets and ability to generate term-docid pairs efficiently. I believe my skillset is ideal for this job as I can use my experience in software architecture, website management and other relevant areas to create an optimum solution for your needs. Additionally my expertise includes developing probabilistic search engines using the BM25 formula which can be used to rank documents returned from an indexer and for a given query return a ranked list of results. Furthermore I can provide well documented sample runs for your queries on the information needs (Democrat's welfare and healthcare reform policies Drug company bankruptcies George Bush) any additional testing or aborted design ideas that show off particular aspects of your project
$20 CAD 7 napon belül
0,0 (0 értékelés)
0,0
0,0
Felhasználó avatár
Hey, I have read your entire job description and am confident that I can do this job of yours. I have a lot of experience in this field. I am an expert in Python, SQL and Software Architecture. I can complete this job of yours in a cost-effective and timely manner. Please feel free to contact me for this project
$11 CAD 1 napon belül
0,0 (0 értékelés)
0,0
0,0

Az ügyfélről

CANADA zászlója
Montreal, Canada
4,8
1
Fizetési mód hitelesítve
Tagság kezdete: aug. 27, 2022

Ügyfél-hitelesítés

Köszönjük! E-mailben elküldtük a linket, melyen átveheti ajándék egyenlegét.
E-mailje elküldése során valami hiba történt. Kérjük, próbálja újra.
Regisztrált Felhasználók Összes Közzétett Munka
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Előnézet betöltése
Hozzáférést adott a helymeghatározáshoz.
Belépési munkamenete lejárt, és kijelentkeztettük. Kérjük, lépjen be újra.