Go to most recent revision | Details | Last modification | View Log | RSS feed
Rev | Author | Line No. | Line |
---|---|---|---|
20 | PointedEar | 1 | \" dvdsubtitles.1 - the *roff document processor src for the dvdsubtitles manual |
2 | \" |
||
3 | \" This file is part of PointedEars' DVD Subtitles. |
||
4 | \" Copyright (C) 2005, 2006 Thomas Lahn <dvd@PointedEars.de> |
||
5 | \" |
||
6 | \" Permission is granted to copy, distribute and/or modify this document |
||
7 | \" under the terms of the GNU Free Documentation License, Version 1.2 |
||
8 | \" or any later version published by the Free Software Foundation; |
||
9 | \" with no Invariant Sections, no Front-Cover Texts, and no Back-Cover |
||
10 | \" Texts. A copy of the license is available on the Web[1] or |
||
11 | \" from the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, |
||
12 | \" Boston, MA 02110-1301 USA. |
||
13 | \" |
||
14 | \" [1] <http://www.gnu.org/licenses/licenses.html#FDL> |
||
15 | \" |
||
16 | \" You may contact the author by: |
||
17 | \" e-mail: dvd@PointedEars.de |
||
18 | \" snail mail: |
||
19 | \" Thomas Lahn |
||
20 | \" Warschauer Strasse 1a/0403 |
||
21 | \" D-99089 Erfurt |
||
22 | \" Federal Republic of Germany |
||
23 | |||
24 | .TH dvdsubtitles 1 "2006-02-15" "0.6.1.2006021519" "DVD Tools" |
||
25 | |||
26 | |||
27 | .SH NAME |
||
28 | dvdsubtitles \- Extract subtitle stream from Video DVD data to text file |
||
29 | |||
30 | |||
31 | .SH SYNOPSIS |
||
32 | |||
33 | .B dvdsubtitles |
||
34 | .br |
||
35 | .RB [ -hVkl ] |
||
36 | .RB [ -t |
||
37 | .IR TITLE ] |
||
38 | .RB [ -s |
||
39 | .IR SUBTITLE ] |
||
40 | .RB [ -o |
||
41 | .IR TARGET ] |
||
42 | .RB [ -g |
||
43 | .IR GREY_LEVELS ] |
||
44 | .RI [ SOURCE ] |
||
45 | |||
46 | .PP |
||
47 | .B dvdsubtitles |
||
48 | .B -c |
||
49 | .RI ( FILE " | " DIRECTORY ) |
||
50 | .RB [ -o |
||
51 | .IR TARGET ] |
||
52 | |||
53 | .PP |
||
54 | .B dvdsubtitles |
||
55 | .RB [ -hVkl ] |
||
56 | .RI [ SOURCE |
||
57 | .RI [ TITLE |
||
58 | .RI [ SUBTITLE |
||
59 | .RI [ TARGET |
||
60 | .RI [ GREY_LEVELS ]]]]] |
||
61 | |||
62 | |||
63 | .SH DESCRIPTION |
||
64 | |||
65 | Uses |
||
66 | .BR tccat (1) |
||
67 | and |
||
68 | .BR tcextract (1) |
||
69 | from the |
||
70 | .B transcode |
||
71 | package to extract a subtitle stream from DVD data. This stream is converted |
||
72 | into image files with |
||
73 | .BR subtitle2pgm , |
||
74 | and those are converted to text files using |
||
75 | .BR pgm2txt , |
||
76 | both from the |
||
77 | .B subtitleripper |
||
78 | package, which in turn calls |
||
79 | .BR gocr (1). |
||
80 | .PP |
||
81 | The previously created subtitle index file (.srtx) and the resulting text |
||
82 | files are then compiled into one text file using either |
||
83 | .BR srttool , |
||
84 | from the |
||
85 | .B subtitleripper |
||
86 | package, or |
||
87 | .BR sed (1). |
||
88 | (See Output Format below.) |
||
89 | .PP |
||
90 | The |
||
91 | .B -c |
||
92 | option allows for compiling existing text files into that format according |
||
93 | a subtitle index file without extracting a subtitle stream first (see below). |
||
94 | \" .PP |
||
95 | \" If the |
||
96 | \" .B -S |
||
97 | \" option is used, this program also performs a minor correction of common |
||
98 | \" misrecognition errors via |
||
99 | \" .BR sed (1), |
||
100 | \" i.e. a standalone `l' (small letter L) or one followed by an apostrophe |
||
101 | \" is converted to `I' (capital letter I). |
||
102 | .PP |
||
103 | All temporary files are deleted afterwards (`clean-up'), unless a fatal error |
||
104 | occurred or a user interrupt was detected; the extracted subtitle stream file |
||
105 | is kept for future use if the |
||
106 | .B -k |
||
107 | option is used. |
||
108 | .PP |
||
109 | Based on |
||
110 | .B DVD ripping and transcoding with Linux |
||
111 | by Moritz Bunkus <moritz@bunkus.org>, available at |
||
112 | .br |
||
113 | <\fIhttp://www.bunkus.org/dvdripping4linux/\fP>. |
||
114 | |||
115 | .SS Output Format |
||
116 | The format of the subtitle text file this program creates is similar to that the |
||
117 | .B SubRip |
||
118 | program creates: |
||
119 | .PP |
||
120 | .I Subtitle number |
||
121 | .br |
||
122 | .IB "Start time " --> " End time" |
||
123 | .br |
||
124 | .I Text of subtitle (one or more lines) |
||
125 | .br |
||
126 | .I Blank line |
||
127 | .PP |
||
128 | Like the SubRip format, field data starts at the beginning of each line. |
||
129 | The time format used is `\fIhours\fB:\fIminutes\fB:\fIseconds\fR', |
||
130 | .I hours |
||
131 | and |
||
132 | .I minutes |
||
133 | with two digits each. The |
||
134 | .I seconds |
||
135 | field is precise to three decimal points; the decimal separator used is |
||
136 | the comma. |
||
137 | .PP |
||
138 | Different to the SubRip format, the line break used is the |
||
139 | standard UNIX Line Feed (LF, 0x0A) character. The |
||
140 | .I Subtitle number |
||
141 | field was not included before version 0.6.1. |
||
142 | |||
143 | |||
144 | .SH OPTIONS |
||
145 | |||
146 | Options are supported since version 0.4. This aims at achieving POSIX |
||
147 | conformance and easy introduction of new features while maintaining |
||
148 | backwards compatibility to versions before 0.4. |
||
149 | .PP |
||
150 | If additional program arguments are used, any option argument is |
||
151 | overwritten by the corresponding program argument. Options may be |
||
152 | given in any order, and are also considered options if located after |
||
153 | the first program argument. Too many program arguments are silently |
||
154 | ignored. |
||
155 | |||
156 | .IP "\fB-c\fP, \fB--compile\fP (\fIFILE\fP | \fIDIRECTORY\fP)" |
||
157 | (since version 0.6.1, \fBTODO\fP) Compile one group, or all groups of |
||
158 | subtitle text files in |
||
159 | .IR DIRECTORY , |
||
160 | according to a corresponding subtitle index |
||
161 | .I FILE |
||
162 | (.srtx), into one text file each. If this is successful, clean up (see |
||
163 | .BR -k ) |
||
164 | and exit. |
||
165 | Options irrelevant to this process are ignored. |
||
166 | |||
167 | .RS |
||
168 | If a |
||
169 | .I DIRECTORY |
||
170 | is specified as argument, the |
||
171 | .B -o |
||
172 | option is ignored. The resulting text files are named after the |
||
173 | corresponding subtitle index files found in the |
||
174 | .I DIRECTORY |
||
175 | instead, and their name is suffixed with `\fB.srt\fP'. |
||
176 | .RE |
||
177 | |||
178 | .IP "\fB-k\fP, \fB--keep\fP" |
||
179 | (since version 0.6) Keep subtitle stream file even if conversion is successful. |
||
180 | |||
181 | .IP "\fB-l\fP, \fB--list\fP" |
||
182 | (since version 0.5) List subtitles for |
||
183 | .I TITLE |
||
184 | using |
||
185 | .BR mplayer (1) |
||
186 | and exit. If |
||
187 | .I TITLE |
||
188 | is not provided or |
||
189 | .RB ` - ', |
||
190 | list subtitles for title #2 (as title #1 may be an intro without subtitles) and |
||
191 | exit. |
||
192 | |||
193 | .IP "\fISOURCE\fP" |
||
194 | Video DVD data source, i.e. a device |
||
195 | .RI "(usually " /dev/dvd ), |
||
196 | a directory e.g. one containing content created via |
||
197 | .BR dvdbackup (1), |
||
198 | or a Video DVD image file. |
||
199 | .br |
||
200 | If |
||
201 | .RB ` - ', |
||
202 | a previously created subtitle stream file named |
||
203 | \fBsubtitle_stream-\fP*\fB-\fP\fITITLE\fP-\fISUBTITLE\fP |
||
204 | in the current working directory will be used for only the |
||
205 | stream-to-graphics-to-text conversion instead. Both |
||
206 | .I TITLE |
||
207 | and |
||
208 | .I SUBTITLE |
||
209 | must not be |
||
210 | .BR ` - ' |
||
211 | in that case. |
||
212 | .br |
||
213 | The default is |
||
214 | .RI ` /dev/dvd '. |
||
215 | |||
216 | .IP "\fB-t\fP, \fB--title\fP \fITITLE\fP" |
||
217 | Number of the title (1-n) which will be accessed for subtitle stream extraction. |
||
218 | If omitted or |
||
219 | .RB ` - ', |
||
220 | the program uses |
||
221 | .BR mplayer (1) |
||
222 | to detect how many titles are available on the DVD data source and asks for the |
||
223 | title to be accessed. |
||
224 | |||
225 | .IP "\fB-s\fP, \fB--subtitle-id\fP \fISUBTITLE\fP" |
||
226 | ID of the subtitle stream to be extracted (0-n). If omitted or |
||
227 | .RB ` - ', |
||
228 | the program uses its |
||
229 | .B -l |
||
230 | option (or |
||
231 | .BR mplayer (1) |
||
232 | directly, before version 0.5) to detect which subtitles are available for the |
||
233 | given TITLE and asks for the ID to be used. |
||
234 | |||
235 | .IP "\fB-o\fP, \fB--output-target\fP \fITARGET\fP" |
||
236 | Name of the resulting subtitles text file. If not provided or |
||
237 | .RB ` - ', |
||
238 | the file is named after the subtitle stream file. |
||
239 | |||
240 | .RS |
||
241 | Versions prior to 0.6 appended the filename suffix |
||
242 | .B .txt |
||
243 | automagically always, and version 0.6 appended this suffix only if this option |
||
244 | was not provided; since version 0.6.1, the filename suffix |
||
245 | .B .srt |
||
246 | (SubRip text) is appended in that case. |
||
247 | |||
248 | .PP |
||
249 | If the file already existed, behavior depends on the program version. |
||
250 | |||
251 | .IP "Since version 0.6.1:" |
||
252 | The program asks whether the file should be overwritten, new content should |
||
253 | be appended to it (including a marker identifying the start of new content), |
||
254 | or if a new file should be created. If the latter, the name of new file |
||
255 | includes the ID of the |
||
256 | .B dvdsubtitles |
||
257 | process which created it. |
||
258 | |||
259 | .IP "Version 0.6:" |
||
260 | The program asks whether the file should be overwritten. If no, a new file |
||
261 | is created, its name including the ID of the |
||
262 | .B dvdsubtitles |
||
263 | process which created it. |
||
264 | |||
265 | .IP "Versions prior to 0.6:" |
||
266 | New content is appended to the file without marker. |
||
267 | .RE |
||
268 | |||
269 | .IP "\fB-g\fP, \fB--grey-levels\fP \fIGREY_LEVELS\fP" |
||
270 | Optional grey-levels value `\fIc0\fP,\fIc1\fP,\fIc2\fP,\fIc3\fP' with 0 <= |
||
271 | .I cN |
||
272 | <= 255, where 0 means black and 255 means white. |
||
273 | |||
274 | .RS |
||
275 | .PP |
||
276 | DVD subtitles allow up to 4 colors. The comma-separated arguments to this |
||
277 | option specify the grey level for each these colors that are used for converting |
||
278 | the subtitle stream graphics into plain text via OCR. The default is |
||
279 | .RB ` 255,255,0,255 '. |
||
280 | The values |
||
281 | .RB ` 255,0,255,255 ', |
||
282 | .RB ` 255,255,255,0 ', |
||
283 | and |
||
284 | .RB ` 0,255,255,255 ' |
||
285 | can also produce viable OCR input. |
||
286 | |||
287 | .PP |
||
288 | Use solid (not outlined) shapes as input when possible to make things easier |
||
289 | for the |
||
290 | .BR gocr (1) |
||
291 | program. It will ask you about a character if it does not recognize it; if |
||
292 | that has an outlined shape, you are advised to cancel the OCR process (with |
||
293 | Ctrl+C). This program will then allow you to use another grey-levels value |
||
294 | without the need to extract the subtitle stream again. |
||
295 | |||
296 | .PP |
||
297 | See the README file of the |
||
298 | .B subtitleripper |
||
299 | package and of the |
||
300 | .B subtitle2pgm |
||
301 | program, which -c option requires this value, for details. (See FILES below.) |
||
302 | .RE |
||
303 | |||
304 | .IP "\fB-h\fP, \fB--help\fP" |
||
305 | Display this help and exit. |
||
306 | |||
307 | .IP "\fB-V\fP, \fB--version\fP" |
||
308 | Display version information and exit. |
||
309 | |||
310 | |||
311 | .SH "EXIT STATUS" |
||
312 | |||
313 | .TP |
||
314 | .B " 0" |
||
315 | Successful program execution |
||
316 | |||
317 | .TP |
||
318 | .B " 1" |
||
319 | Error detecting/extracting subtitle stream, or cancelled without selecting a |
||
320 | title number or subtitle ID |
||
321 | |||
322 | .TP |
||
323 | .B " 2" |
||
324 | Unable to convert subtitle stream to image files |
||
325 | |||
326 | .TP |
||
327 | .B " 3" |
||
328 | Cancelled due to |
||
329 | .BR gocr (1) |
||
330 | error or without entering another grey-levels value |
||
331 | |||
332 | .TP |
||
333 | .B " 4" |
||
334 | Unable to compile to text file |
||
335 | |||
336 | .TP |
||
337 | .B " 5" |
||
338 | Unable to clean up |
||
339 | |||
340 | .TP |
||
341 | .B "127" |
||
342 | Insufficient number of arguments / help was displayed |
||
343 | |||
344 | |||
345 | .SH FILES |
||
346 | .TP |
||
347 | .I /usr/share/doc/subtitleripper/README.gz |
||
348 | Location of the |
||
349 | .B subtitleripper |
||
350 | package README on Debian systems |
||
351 | |||
352 | .TP |
||
353 | .I /usr/share/doc/subtitleripper/README.subtitle2pgm.gz |
||
354 | Location of the |
||
355 | .B subtitle2pgm |
||
356 | program README on Debian systems |
||
357 | |||
358 | |||
359 | .SH NOTES |
||
360 | Tested with GNU/Linux 2.4.31 and 2.6.1.13 to .15 |
||
361 | .B [1] |
||
362 | (Debian |
||
363 | .B [2] |
||
364 | Sarge). |
||
365 | |||
366 | .PP |
||
367 | .B [1] |
||
368 | .RI < http://kernel.org/ > |
||
369 | .br |
||
370 | .B [2] |
||
371 | .RI < http://debian.org/ > |
||
372 | |||
373 | |||
374 | .SH BUGS |
||
375 | Please send comments and bug reports to |
||
376 | .RI < dvd@PointedEars.de >. |
||
377 | |||
378 | |||
379 | .SH "LEGAL NOTICE" |
||
380 | Copyright (c) 2005, 2006 Thomas Lahn <dvd@PointedEars.de> |
||
381 | .PP |
||
382 | This program is free software; you can redistribute it and/or modify it |
||
383 | under the terms of the GNU General Public License as published by the |
||
384 | Free Software Foundation; either version 2 of the License, or (at your |
||
385 | option) any later version. |
||
386 | .PP |
||
387 | This program is distributed in the hope that it will be useful, but |
||
388 | WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY |
||
389 | or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License |
||
390 | (GPL) for more details. |
||
391 | |||
392 | You should have received a copy of the GNU GPL along with this program |
||
393 | .RI ( COPYING " file);" |
||
394 | if not, go to |
||
395 | .B [1] |
||
396 | or write to the Free Software Foundation, Inc., 51 Franklin St, Fifth Floor, |
||
397 | Boston, MA 02110-1301, USA. |
||
398 | .PP |
||
399 | .B [1] |
||
400 | .RI < http://www.gnu.org/licenses/licenses.html#GPL > |
||
401 | .PP |
||
402 | .br |
||
403 | -- |
||
404 | .br |
||
405 | Standard shell script disclaimer blurb thing: |
||
406 | |||
407 | This script is a hack. It's brute force. It's horrible. |
||
408 | It doesn't use Artificial Intelligence. It doesn't use Virtual Reality. |
||
409 | It's not perl. It's not python. It probably won't work unchanged on |
||
410 | the "other" thousands of unices. But it worksforme. --ramiro |
||
411 | .br |
||
412 | (from |
||
413 | .IR /usr/local/mozilla/run-mozilla.sh ) |
||
414 | |||
415 | |||
416 | .SH CREDITS |
||
417 | Thanks to Boris 'bolau' Lau <mail@bolau.de> for translation support, |
||
418 | and to Erkan Yanar <erkan.yanar@t-online.de> for pointing out how to |
||
419 | use |
||
420 | .BR sed (1) |
||
421 | to replace a filename with the file content. |
||
422 | |||
423 | |||
424 | .SH AVAILABILITY |
||
425 | The author's latest version can be obtained from |
||
426 | .br |
||
427 | .RI < http://PointedEars.de/tools/multimedia/dvd/ >. |
||
428 | |||
429 | |||
430 | .SH "SEE ALSO" |
||
431 | |||
432 | .BR dvdbackup (1), |
||
433 | .BR gocr (1), |
||
434 | .BR mplayer (1), |
||
435 | .BR pgm2txt , |
||
436 | .BR sed (1), |
||
437 | .BR srttool " and " subtitle2pgm |
||
438 | (from <\fIhttp://subtitleripper.sourceforge.net/\fP>), |
||
439 | .BR tccat (1), |
||
440 | .BR tcextract (1) |