chunkster

From twext

Jump to: navigation, search

chunkster is software that chunks text via twext method

for now, chunkster works by inserting returns

  1. BEFO before specified text_strings
  2. AFTE after specified text_strings
  3. BOTH before and after specified text_strings

error correction needed, possibly including chunk exceptions image:ChunxtaExceptions.png

chances are good that chunkster will suck hard at first, but learn to better serve

  • variable languages
  • variable users
  • variable chunk widths

for now we're just making first step to chunk lyrics..

here's a list of possible variables:

[edit] BEFO

i, your, to, under, on, at, of, the, you, your, you're a, so, my, is, too, i'm, i've, you'll, as, she, she's, in, don't, by, are, has

[edit] AFTE

,, then, it, with, but, there's, me,

[edit] BOTH

and, that, if, now, how, what, when, why, where, for, with, whenever, without, who, in




BUG if return added before BEFO or BOTH then don't add new return

CHUNKSTER for TWEXTER

chunkster software "chunks" text, sends chunked text to http://translate.google.com where user identifies the language pair, gets translations for each chunk, then puts the chunked text and translation into corresponding text input fields at http://twext.cc/twexter

"chunking" text means get text input, insert an ordered set of returns into the text, then output the text.. the returns are inserted in a pattern defined at http://twext.cc/go/method

the problem to solve is defined: make software chunk a text as per http://twext.cc/go/method.. any solution is welcome.. the problem was once solved with the following solution:

user defined text strings specify where returns are added to chunk a text.. returns are added three ways:

  1. BEFORE a specified text string
  2. AFTER a specfified text string
  3. BEFORE AND AFTER a specified text string

exceptions to above rules are defined in corresponding text fields.. exception let users instruct chunkster software NOT to insert returns before, after, or before and after specific, excepted text strings

ancient perl code at http://twext.com/dev/chunkster/chunkster.perl.txt was once able to chunk text

the perl script refered to text input fields where users defined text strings that get returns inserted before, after, or before-and-after.. corresponding text input fields enabled users to specify exceptions.. an illustration of the input field array is at http://twext.cc/dev/dev2006.html#CHUNKSTA

a flowchart at http://twext.com/dev/chunkster.pdf describes how the solution worked.. if you work with this solution, start at 814 and continue to 860.. refer to text description of program at http://twext.cc/go/814 .. simplify and adapt as needed

at this stage, we're working only with text in the form of lyrics.. later, chunkster should work with regular text in sentence and paragraph format.. for now, chunkster should output in lyric format, meaning:

  • one return to specify a chunk
  • two returns to start a new line of lyrics
  • three returns to start a chorus (add empty line between lines)

repeat: above solution may suck, but it's a start.. the problem is clear: chunk text as per http://twext.cc/go/method ..

your solution should plug in to a php core..

http://twext.cc/go/814




Retrieved from "http://twext.com/chunkster"
Personal tools