Reply to post

Helpful ReplyHot![FAQ]Avoid reading superscript, e.g. citation (reference) numbers on Wikipedia etc.

Author
Admin
Administrator
  • Total Posts : 275
  • Reward points: 0
  • Joined: 2010/11/22 00:00:00
  • Location: USA
  • Status: offline
2016/02/19 07:09:19 (permalink)

Avoid reading superscript, e.g. citation (reference) numbers on Wikipedia etc.

The small superscript text, like reference numbers in scientific articles, Wikipedia articles etc., can be removed from speech output in @Voice with our "Edit speech" feature under menu - Settings. Go there and press the + button on top to add a new speech replacement, then enter exactly the following, make also sure that there are no spaces between and after the entered text:
Replacement type: Regular Expression (RegEx)
Pattern: (<sup>|<sup\b)((?!</sup>).)*</sup>
Replace: 
Leaving the "Replace" field empty will simply silence the references. If you want, you could enter some alternative text there, e.g. "reference provided" or whatever.
 
Here is another replacement, that would avoid reading aloud any text contained within square brackets [like this]:
Pattern: \[.*?\]
 
and a simpler one, that will avoid reading only numbers in square brackets, like [1], [375] etc. :
Pattern: \[\d+\]
Note, the above would not work for Wikipedia or many other Web pages with citations, as inside the square brackets they have some HTML formatting tags, not only numbers.
 
If you are curious, read some of many RegEx references and tutorials on the Web. The particular implementation of Regular Expressions that @Voice uses is described at http://www.regexlab.com/en/deelx/syntax.htm
 
Greg
post edited by Admin - 2019/12/01 04:48:39
Sss
User
  • Total Posts : 0
  • Reward points: 100
  • Joined: 2020/02/12 05:29:17
  • Status: offline
Re: Avoid reading superscript, e.g. citation (reference) numbers on Wikipedia etc. 2020/03/23 22:42:45 (permalink)
Is there a way we can exclude citations starting with numbers in reduced font size?

Following is the sample: 
 
Paul Ree exemplifies a tendency to place present uses of moral concepts — ‘punishment’ for what deters the egoistic, and ‘good’
²⁶ GM II. 12, on which, see also Leiter (2002: 168, 198).
²⁷ On Ree’s account of punishment, see Ch. 5 below.
used as praising the unegoistic—at the origin of morality.
 
This is how the text is being read. After reading 'good' the actual sentence has to be continued from 'used'. I think it is bit difficult to make it leap to 'used' accurately. But if we can just skip the respective sentence which starts with a number in lower font, it will be good. I can send the full PDF if it helps
post edited by Sss - 2020/03/23 22:45:07
Admin
Administrator
  • Total Posts : 275
  • Reward points: 0
  • Joined: 2010/11/22 00:00:00
  • Location: USA
  • Status: offline
Re: Avoid reading superscript, e.g. citation (reference) numbers on Wikipedia etc. 2020/03/24 04:12:23 (permalink) ☄ Helpfulby Sss 2020/03/24 04:18:46
This is very difficult. The best way to exclude such things would be to open the PDF from the "Open" button on top of screen (folder icon), then on the next screen choose the option to "manually crop pages", then as the pages of that PDF are shown on screen in their original format, go through all the pages of article and drag the low edge of the page above the first citation at the bottom of that page. I don't have a way to detect such things automatically at this time, and don't know if I'll ever be able to figure out this.
orineu
User
  • Total Posts : 0
  • Reward points: 0
  • Joined: 2019/10/09 14:37:59
  • Status: offline
Re: Avoid reading superscript, e.g. citation (reference) numbers on Wikipedia etc. 2020/04/02 13:52:44 (permalink)
Sometimes when I encounter this in ebooks, the citations are at the end of the chapter, and I can skip them by manually scrolling down to the bottom and choosing the next chapter option. On some articles, this can be quite a long scroll, however. And nonchaptered ebooks do not have this functionality. Also, OCR of certain books I've read nondigitally would have line-splits as the text wraps around same-page citations... newer made-for-screens versions have no reason to do that, but everyone knows how considerate textbook publishers are of their blind customers.

This has inspired several ideas for implementations that would assist with this and similar issues (read as, "it would be really cool if we could:")
* map keypresses from physical hardware such as a bluetooth keyboard to controls, to set up skipping lines and chapters using wasd perhaps, bluetooth keyboards can be accessibly cheap although ymmv for usability due to bluetooth connection quirks, consider other externals such as android wear and unique patterns of omnibutton/play-pause presses.
* enable optional floating softcontrol menu to include long skip (customizable-length multi-sentence skip separate from current short skip), previous chapter/article, next chapter/article, and 'back'.
* utilize a checkbox that will automatically create something like bookmarks for numbered lines on pdf import, to be combined with
* optionally create a link on the small number found in the text with the small instance of the number following, this may not work with OCR and would absolutely require a 'back' or 'return' feature
post edited by orineu - 2020/04/02 13:54:47
Admin
Administrator
  • Total Posts : 275
  • Reward points: 0
  • Joined: 2010/11/22 00:00:00
  • Location: USA
  • Status: offline
Re: Avoid reading superscript, e.g. citation (reference) numbers on Wikipedia etc. 2020/04/02 14:17:18 (permalink)
Again, for users with good eyesight probably the best option, when dealing with PDF files that contain references or footnotes at the bottom of each page, would be to use "manually crop pages" option that I described in my previously posted message here. I don't see any other good option, don't know how to programmatically detect these. Relying on small numbers starting such footnotes, or the fact that they may be written in a smaller font, is not certain. I have seen footnotes that span several paragraphs and sometimes are continued in footnotes on next pages. Re-configuring buttons to skip more - maybe, but how much? Several phrases/sentences? Entire paragraphs? No idea...
Jump to:
© 2020 APG vNext Commercial Version 5.1