lyric chunkster
From twext
|
LYRIC CHUNKSTER FOR TWEXTER IN PHP this should be a simple job for a developer with solid experience in PHP, maybe AJAX and definitely OPEN SOURCE collaborative software development.
[edit] MILESTONES:
[edit] backgroundthe chunkster software:
"chunking" text means get text input, insert an ordered set of returns into the text, then output the text.. the returns are inserted in pattern as per method.. lyric chunking is much simpler than text chunking.. we don't have to solve the text chunking problem now.. we do have to solve the lyric chunking problem now.. solve the problem how you want.. know that even the text chunking problem was once sorta solved with this solution: user defined text strings specify where returns are added to chunk a text.. returns are added three ways:
EXCEPTIONS to above rules were defined in related text fields.. exceptions let users instruct chunkster software NOT to insert returns before, after, or before and after specified, excepted text strings ancient perl code at http://twext.com/dev/chunkster/chunkster.perl.txt once chunked text.. sorta.. the perl script refered to user modifiable textarea input fields like those illustrated here:
http://twext.cc/dev/dev2006.html#CHUNKSTA a flowchart at http://twext.com/dev/chunkster.pdf and text at http://twext.cc/go/814 describe how the solution worked.. this problem is much simpler when focus narrows to simply CHUNK LYRICS.. [edit] CHUNK LYRICShere is an example of a simple lyric text that lyric chunkster must chunk: -------------------------------+
this line of lyrics to a song,
another line, not as long
skip a line to start a chorus
-------------------------------+
[edit] 1. CHUNK LINES AND CHORUS
-------------------------------+
this line of lyrics to a song,
another line, not as long
skip a line to start a chorus
-------------------------------+
[edit] 2. CHUNK WITHIN LINESrefer to chunk criteria inputs as demonstrated at: http://twext.cc/dev/dev2006.html#CHUNKSTER
if the BEFO STRINGS include: "to", "a" and if AFTE STRINGS include: "," and if BOTH STRINGS include: "this" then the software add returns to produce this result: -------------------------------+
this
line of lyrics
to
a song,
another line,
not as long
skip
a line
to start
a chorus
-------------------------------+
[edit] 3. CLEANUPif a line starts with a BOTH STRING, then delete the return added before string in step 2 if AFTE STRING ends a line then delete the return added after string in step 2 after step three, we should get this result: -------------------------------+
this
line of lyrics
to
a song,
another line,
not as long
skip
a line
to start
a chorus
-------------------------------+
the result is text chunked as per twext method, so please include some wiggle room in your bid here.. the main trick is find and fix double chunk errors.. we focus on chunking one language, english, for now.. meaning only on set of BEFO, AFTE, BOTH fields.. http://twext.cc/dev/dev2006.html#CHUNKSTER includes fields for exceptions of chunk criteria.. you should anticipate this function but we DO NOT NEED TO MAKE EXCEPTIONS AT THIS STAGE our purpose is to create and test a very simple LYRIC CHUNKSTER written in PHP5.. we will test your lyric chunkster with a wide variety of lyrics, poems etc.. obviously, the above solution is not complete, so we'll probably run through a few testing cycles.. the problem, however, is clearly defined, narrow and simple, so this is unlikely to be a huge challenge for us.. if we DO need to add a layer of complexity to make lyric chunkster work, such as including fields to manage exceptions to BEFO/AFTE/BOTH chunk strings, then that will be extra work and extra pay for you.. [edit] MACHINE TRANSLATEGOOGLE TRANSLATE THE LYRIC CHUNKS when we are happy with lyric chunkster function, we will connect it to an http://translate.google.com interface.. ideally, we have an intermediary interface that integrates with http://twext.cc/twexter thus, a user will
then, the software will
user, if translator, can make any corrections needed, and save the file on their machine.. later we'll worry about titles, storing and finding data, titles, users, managing languages and all that stuff.. for now, we focus very simply on CHUNKING LYRICS and GETTING TRANSLATIONS and the PLUGIN to twexter.. [edit] COMPLETEcompletion of work must include: plugin interface to core, clear English code comments; psuedocode; author name and email contact; copyright info; GPLv2 license; and source code shared at http://sf.net/projects/twexter [edit] PLUGIN TO PHP TWEXTERhttp://twext.cc/go/plugin descibes a PHP architecture being developed to include/exclude functions like LYRIC CHUNKSTER into twexter software builds.. this should be pretty easy to coordinate [edit] TEST.TWEXT.COMplugin core connects lyric chunkster with http://twext.cc/twexter function.. we test complete system installable as needed at http://test.twext.cc [edit] CLEAR COMMENTS IN ENGLISHyour core code must be explicitly commented and explained so other developers can easily participate.. [edit] EXPLICIT PROGRAM LOGICthis is freely licensed open software.. parallel systems may emerge in python, ruby, scheme, etc.. PSEUDO CODE, UML, FLOWCHART or likewise explicit description of the program logic is required to help others understand, use and extend your work.. [edit] SHARING AND ATTRIBUTIONplease include your name and email so developers can know who did the work and contact you if need be.. successful execution of this work may also serve to promote your services. [edit] ASSIGN COPYRIGHT TO READ.FMthis is an explicit work-for-hire agreement you assign your copyright to read.fm, which freely releases your work under the GPLv2.. [edit] INCLUDE COPYRIGHT AND LICENSE NOTICE ON ALL CODE:twext helps you learn to read in any language free software: http://sf.net/projects/twexter Copyright © 2008 READ.FM http://license.read.fm http://more.read.fm/more more read, more market [edit] SHARE SOURCE CODEyou share complete source code and all documentation at http://sf.net/projects/twexter [edit] PAYMENTfunds are already in GAF escrow.. upon satisfactory completion, payday [edit] SERIOUSplease spare us if not serious.. Waqas won http://twext.cc/dev/twexterBASIC.html job by delivering a rough working demo *before* bid awarded.. he showed real skills and interest.. these next steps, done well, can lead to extensive future collab..
|
zura re chunkster some old code might be helpful to do chunkster? chances our this chunkster for now will we probably should add a very simple layer between lyric chunkster and translate.google.. re: chunkster and editing chunks, the live preview at http://twext.cc/twexter makes editing nearly impossible.. an earlier twexter had twexml option and easier chunk editing: http://twext.cc/twexml both above basic twexter versions (code by Waqas) printing on the wiki soon here's a list of chunkster chunks in english to play with:[edit] BEFOi, your, to, under, on, at, of, the, you, your, you're a, so, my, is, too, i'm, i've, you'll, as, she, she's, in, don't, by, are, has [edit] AFTE,, then, it, with, but, there's, me, [edit] BOTHand, that, if, now, how, what, when, why, where, for, with, whenever, without, who, in
but we aren't doing that yet |



