chunkster
From twext
|
chunkster is software that chunks text via twext method
for now, chunkster works by inserting returns
error correction needed, possibly including chunk exceptions
chances are good that chunkster will suck hard at first, but learn to better serve
for now we're just making first step to chunk lyrics.. here's a list of possible variables: [edit] BEFOi, your, to, under, on, at, of, the, you, your, you're a, so, my, is, too, i'm, i've, you'll, as, she, she's, in, don't, by, are, has [edit] AFTE,, then, it, with, but, there's, me, [edit] BOTHand, that, if, now, how, what, when, why, where, for, with, whenever, without, who, in
BUG if return added before BEFO or BOTH then don't add new return CHUNKSTER for TWEXTER chunkster software "chunks" text, sends chunked text to http://translate.google.com where user identifies the language pair, gets translations for each chunk, then puts the chunked text and translation into corresponding text input fields at http://twext.cc/twexter "chunking" text means get text input, insert an ordered set of returns into the text, then output the text.. the returns are inserted in a pattern defined at http://twext.cc/go/method the problem to solve is defined: make software chunk a text as per http://twext.cc/go/method.. any solution is welcome.. the problem was once solved with the following solution: user defined text strings specify where returns are added to chunk a text.. returns are added three ways:
exceptions to above rules are defined in corresponding text fields.. exception let users instruct chunkster software NOT to insert returns before, after, or before and after specific, excepted text strings ancient perl code at http://twext.com/dev/chunkster/chunkster.perl.txt was once able to chunk text the perl script refered to text input fields where users defined text strings that get returns inserted before, after, or before-and-after.. corresponding text input fields enabled users to specify exceptions.. an illustration of the input field array is at http://twext.cc/dev/dev2006.html#CHUNKSTA a flowchart at http://twext.com/dev/chunkster.pdf describes how the solution worked.. if you work with this solution, start at 814 and continue to 860.. refer to text description of program at http://twext.cc/go/814 .. simplify and adapt as needed at this stage, we're working only with text in the form of lyrics.. later, chunkster should work with regular text in sentence and paragraph format.. for now, chunkster should output in lyric format, meaning:
repeat: above solution may suck, but it's a start.. the problem is clear: chunk text as per http://twext.cc/go/method .. your solution should plug in to a php core..
|
|


