Lezárt

C# text summarize

A projektre 19 árajánlat érkezett képzett szabadúszóinktól, az átlagos ajánlat a következő volt: £25 GBP.

Kapjon ingyenes árajánlatokat egy hasonló projektre
A munkaadó dolgozik
Projekt költségvetés
£10 - £20 GBP
Összes Árajánlat
19
Projekleírás

A C# console application, that lets the user input a text file, the used then enters the percent summarization factor (numbered %) the users file is read line by line and stop words need to be removed based on their frequency and the new text needs to be saved as a new text file.

More detail:

Extractive methods - selecting important sentences, paragraphs etc. from the original

document and concatenating them into a shorter form. The importance of sentences is

decided based on statistical and linguistic features of sentences.

Application must be console based

1. Prompt the user to enter an input filename (eg [url removed, login to view]) and a percent summarization

factor, SF (where SF = summarizedWordLength x 100 / inputWordLength).

2. Read the text from [url removed, login to view] [url removed, login to view] and process the text from [url removed, login to view]

accordingly.

3. Output the summary text to file (eg [url removed, login to view]) and display to console some

appropriate statistics.

Count the occurrence of different words in the document, copying the words into a list

which is ordered by frequency with the most common word at the top of the list.

 Copy each sentence in the document into a second list.

 Remove (filter) those words from the word frequency list which are very common and

of little use in classifying the document. These are called ‘stop words’ and include

words such as such as the, is, at, which, and on. There is no definitive listing of what

defines a stop word as it could be document specific, however the file ‘[url removed, login to view]’

is provided containing a listing of generic common stop words

For each sentence, count the number of the words that matches the top word

(most frequent) in the filtered word list.

 Find the sentence that has the highest number of occurrences of the most

frequent word.

 If the length of words in the summary text added to the current selected

sentence word length exceeds the summary word length limit the sentence is

ignored.

 Else add the sentence to the summary text. Remove the word from the top of

the frequency word list and the sentence from the listing of word sentences.

}

 Output summary text to file and display appropriate statistics, including actual SF.

Szeretne pénzt keresni?

  • Adja meg a költségvetést és a munkaelvégzéséhez szükséges időt
  • Vázolja fel ajánlatát
  • Kapjon fizetést munkájáért

Alkalmazza a szabadúszókat, akik erre a projektre is ajánlatot adtak

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online