Lewis@Gmail
2008-03-11 00:27:25 UTC
I have a file encoded as ISO-8859 (according to the file command at
the command line). it is the ratings.file from imdb's database, and
BBEdit says it's "Western (Mac OS Roman)"
I need the file to be plain ASCII so that I can do grep searches
against it via a php script. Here is some sample data:
0000000123 119567 8.6 LÈon (1994)
0000000124 120390 8.6 Fabuleux destin d'AmÈlie Poulain, Le
(2001)
0000000123 24627 8.5 RashÙmon (1950)
0000000124 69931 8.4 Vita Ë bella, La (1997)
0000000123 12564 8.3 Smultronst‰llet (1957)
0000000114 17411 8.2 8Ω (1963)
I can Zap Gremlins to replace with the code
0000000123 119567 8.6 L\0xC8on (1994)
0000000124 120390 8.6 Fabuleux destin d'Am\0xC8lie Poulain,
Le (2001)
But that doesn't help me in doing a grep search through the file.
I also don't understand why "Smultronstället" shows up as "Smultronst
‰llet" or why 'LÈon' appears instead of 'Léon', etc.
what I want is 'Leon', 'Fabuleux destine d'Amelie Poulain, Le',
'Rashomon', 'Vita e bella, La', and 'Smultronstallet' and '8 1/2'.
And it needs to be fairly quick and easy to fix because I need to
update this file every month or two.
And if anyone knows what I am doing: yes, I did try to compile the
moviedb-3.24 package under Leopard and failed badly.
--
We will fight for Bovine Freedom and hold our large heads high
We will run free with the Buffalo or die
the command line). it is the ratings.file from imdb's database, and
BBEdit says it's "Western (Mac OS Roman)"
I need the file to be plain ASCII so that I can do grep searches
against it via a php script. Here is some sample data:
0000000123 119567 8.6 LÈon (1994)
0000000124 120390 8.6 Fabuleux destin d'AmÈlie Poulain, Le
(2001)
0000000123 24627 8.5 RashÙmon (1950)
0000000124 69931 8.4 Vita Ë bella, La (1997)
0000000123 12564 8.3 Smultronst‰llet (1957)
0000000114 17411 8.2 8Ω (1963)
I can Zap Gremlins to replace with the code
0000000123 119567 8.6 L\0xC8on (1994)
0000000124 120390 8.6 Fabuleux destin d'Am\0xC8lie Poulain,
Le (2001)
But that doesn't help me in doing a grep search through the file.
I also don't understand why "Smultronstället" shows up as "Smultronst
‰llet" or why 'LÈon' appears instead of 'Léon', etc.
what I want is 'Leon', 'Fabuleux destine d'Amelie Poulain, Le',
'Rashomon', 'Vita e bella, La', and 'Smultronstallet' and '8 1/2'.
And it needs to be fairly quick and easy to fix because I need to
update this file every month or two.
And if anyone knows what I am doing: yes, I did try to compile the
moviedb-3.24 package under Leopard and failed badly.
--
We will fight for Bovine Freedom and hold our large heads high
We will run free with the Buffalo or die
--
------------------------------------------------------------------
Have a feature request? Not sure the software's working correctly?
If so, please send mail to <***@barebones.com>, not to the list.
List FAQ: <http://www.barebones.com/support/lists/bbedit_talk.shtml>
List archives: <http://www.listsearch.com/BBEditTalk.lasso>
To unsubscribe, send mail to: <bbedit-talk-***@barebones.com>
------------------------------------------------------------------
Have a feature request? Not sure the software's working correctly?
If so, please send mail to <***@barebones.com>, not to the list.
List FAQ: <http://www.barebones.com/support/lists/bbedit_talk.shtml>
List archives: <http://www.listsearch.com/BBEditTalk.lasso>
To unsubscribe, send mail to: <bbedit-talk-***@barebones.com>