Browse code

update tsv backend

devnewton authored on 25/06/2017 18:10:02
Showing 1 changed files
... ...
@@ -12,6 +12,18 @@ Each line looks like:
12 12
 ${id}\t${time}\t${info}\t${login}\t${message}\n
13 13
 ```
14 14
 
15
+Fields MUST not contains the following forbidden characters:
16
+
17
+- [CHARACTER TABULATION](http://www.fileformat.info/info/unicode/char/0009/index.htm)
18
+- [CARRIAGE RETURN](http://www.fileformat.info/info/unicode/char/000D/index.htm)
19
+- [LINE FEED](http://www.fileformat.info/info/unicode/char/000A/index.htm)
20
+
21
+[Bouchots](../ontology/bouchot.md) SHOULD replace forbidden characters by
22
+[SPACE](http://www.fileformat.info/info/unicode/char/0020/index.htm).
23
+
24
+[Bouchots](../ontology/bouchot.md) MAY strip or replace non printable characters
25
+by [SPACE](http://www.fileformat.info/info/unicode/char/0020/index.htm).
26
+
15 27
 ## id
16 28
 
17 29
 Technical post numeric identifier.
... ...
@@ -24,14 +36,10 @@ Date and time of post in yyyyMMddHHmmss format.
24 36
 
25 37
 Free text related to posting [moule](../ontology/moules.md). Usually nickname or browser [User Agent](https://en.wikipedia.org/wiki/User_agent).
26 38
 
27
-This field is stripped from any space character other than [SPACE](http://www.fileformat.info/info/unicode/char/0020/index.htm).
28
-
29 39
 ## login
30 40
 
31 41
 Optional authenticated  user login.
32 42
 
33
-This field is stripped from any space character other than [SPACE](http://www.fileformat.info/info/unicode/char/0020/index.htm).
34
-
35 43
 ## message
36 44
 
37
-Message body in [BML](./legacy_bml.md) stripped from any space character other than [SPACE](http://www.fileformat.info/info/unicode/char/0020/index.htm).
38 45
\ No newline at end of file
46
+Message body in [BML](./legacy_bml.md).
39 47
\ No newline at end of file