Not signed in (Sign In)

Categories

Vanilla 1.1.1 is a product of Lussumo. More Information: Documentation, Community Support.

    • CommentAuthorBrankGnol
    • CommentTimeNov 20th 2007
     permalink
    Hello,

    I'm working since a few months with Intype without (too much ;) ) problems.
    But I recently got the same problems several times.
    I'm web developper and I often got my file converting to ANSI (from UTF-8) witthout action from me. And i got fifficulties to come back to UTF-8. I have to change of editor to restore my encoding.

    Anybody else got the problem ? Any idea from where it comes from ?

    Thanks in advance, and I hope it could be solved, I have to uninstall Intype now, to prevent from this reccurent problem.

    Tonio

    PS:sorry for this horrible english. ;)
  1.  permalink

    BrankGnol: Your english looks pretty nice, take a look at mine: :)

    Well, there are few things to know about ANSI encoding and the detector (metacrawler plugin). ANSI encoding is a generic encoding for 7-bit encodings. ANSI files therefore contains only character codes under 128 (decimal). When you open a file in Intype, the encoding detector scans first 32kB of the file. If all characters in this first block have lower code than 128, then it returns ANSI. So even if you are saving file using UTF-8 Plain (do not confuse with UTF-8), the detector still fallbacks to the ANSI. We will be working more on this, so it will fallback to UTF-8 Plain if you will set it in the preferences. If you select ANSI and you will use characters with codes greater or equal to 128, Intype will save the file with your current system code-page (we are working on supporting various codepages).

    The difference between UTF-8 and UTF-8 Plain is that the plain version is not storing the BOM bytes to the beginning of the file. So another opening of file in UTF-8 Plain may sometimes result in detecting ANSI. However, using UTF-8 saves the BOM to the beginning of the file and the detector should never return ANSI at next open.

    Another thing is that the conversion of the encoding is not made immediately after selecting different encoding, but is done at save. Intype keeps all files in UTF-16 LE in the memory, and the conversion is made only at the save time.

    Using UTF-8 (not the UTF-8 Plain), should work without problems. If you have problem with UTF-8, it’s a bug. Can you, please, confirm that you are using UTF-8 (not the UTF-8 Plain), so we can declare this as a bug? Or am I completely wrong, and the problem is somewhere else? :)

    • CommentAuthoryita
    • CommentTimeDec 7th 2007 edited
     permalink

    I know that this should not be regarded as a bug, but the use of English in the default font such as “consolas” (because consolas is beautiful), is Unable to display Chinese characters where they become a Block..It’s a big problem for non-English users like me.
    I really expect it to better support other languages, presumably it will not pay a dear price, but it will increase the number of users of other languages.
    Thank you very much to bring us this excellent, simple editor; I hope that more people can use it!

    BrankGnol:
    PS:sorry for this horrible english. ;)

    •  
      CommentAuthordflock
    • CommentTimeDec 7th 2007
     permalink

    This will require the use of fall back fonts for missing glyphs – for example if you’re using Consolas (which is, I agree, beautiful) and trying to display these characters:

    埨仝佲텇漊㌞㌶ – (see Note 1, below)

    you’ll get little ‘missing glyph’ blocks instead, because Consolas doesn’t have glyphs for these characters – not many western fonts do.

    What intype would need to do is to use a different font (or fonts) to display these characters – ones which do have glyphs for them – such as ‘Arial Unicode MS’ – which has pretty much the whole Unicode range (it’s a 25Mb font file!)

    This is what (properly setup) modern web browsers do – no font can be expected to contain glyphs for every single Unicode character, so you need to use the user’s chosen font when you can, but fall back to other fonts for any (usually international) characters which might be missing.

    This might be more complex for an editor like intype though, as it’s not just read-only like a web browser and it would also almost certainly mean sometimes mixing mono-spaced and proportional fonts in the same document, line or even word.

    Note 1: These are just randomly chosen Chinese/Japanese/funny looking Unicode characters – I’ve no idea what they mean. If you’re seeing little boxes, blocks or question marks instead, you’re either using an old web browser, or it’s not setup correctly. Try using ‘Arial Unicode MS’ for the international fonts in your browsers preference settings.

    • CommentAuthornolan
    • CommentTimeDec 7th 2007
     permalink

    Hi,
    While on the topic of encoding.. I have a question that has been on my mind for some years now. Why doesn’t Intype display an asterisk (or a notification that the file has been modified) when the encoding is changed via the menu? Other editors do that too and I always type some random keys then delete them immediately then save to be sure. Is the file saved as soon as encoding is changed ?

    • CommentAuthordaryl
    • CommentTimeDec 7th 2007
     permalink

    No, you’re right, you do need to save to have then encoding change take effect. If I recall correctly Intype uses CR+LF and UTF-16 internally, and converts to the chosen settings on save. Infact, this brings me onto a “bug” I thought might exist the other day, but forgot to check:

    If the file is set to “LF” endings then one would expect searching for “blah\n” might work, but as the file is stored internally as CR+LF you have to search for “blah\r\n”.

  2.  permalink

    dflock: Yeah, font-fallback is not available currently. However, the rendering engine has no problems with non-monospaced fonts, or combinations of various fonts in one documents. I have this feature on my list and I will be working on it during the Christmas.

    nolan: This is actually a bug. The functionality is there, but is not working properly. We will fix this for the next release. So, when you change the line-ending, or encoding, the document gets modified flag.

    daryl: In the release, we are finishing now (the almighty 666) we have solved this problem. This release is internally using LF line-endings (instead of current CR+LF), so all RE matches and searches should work properly. And you noted it right. No matter what line-ending (or encoding) you set in the status-bar, Intype will always see the document in it’s internal representation, which in the new release is UTF-16 LE with LF line endings, and all features depends on this.