Recent Topics

[Today at 01:46:12 PM]

[Today at 09:27:29 AM]

[Today at 06:04:43 AM]

[Today at 01:20:33 AM]

[May 22, 2019, 05:17:19 PM]

[May 22, 2019, 11:43:06 AM]

[May 22, 2019, 11:22:45 AM]

[May 22, 2019, 02:44:35 AM]

[May 22, 2019, 01:05:34 AM]

[May 21, 2019, 04:58:49 AM]

[May 20, 2019, 05:47:16 PM]

[May 20, 2019, 03:37:42 PM]

[May 19, 2019, 05:50:42 AM]

[May 18, 2019, 11:53:48 PM]

[May 18, 2019, 01:47:14 PM]

[May 18, 2019, 01:45:55 PM]

[May 18, 2019, 01:44:42 PM]

[May 18, 2019, 01:43:15 PM]

[May 18, 2019, 09:09:22 AM]

[May 18, 2019, 06:11:47 AM]

[May 18, 2019, 04:55:52 AM]

[May 18, 2019, 04:30:31 AM]

[May 18, 2019, 03:59:38 AM]

[May 17, 2019, 01:29:38 AM]

[May 14, 2019, 05:36:15 AM]

[May 13, 2019, 07:06:45 PM]

[May 12, 2019, 02:29:06 PM]

[May 05, 2019, 12:40:38 AM]

[May 02, 2019, 03:16:56 PM]

[May 01, 2019, 04:34:46 PM]

[April 30, 2019, 08:30:20 AM]

[April 30, 2019, 08:27:18 AM]

Talkbox

2019 May 20 04:14:26
Cheav Villa:  _/\_ _/\_ _/\_

2019 May 20 01:31:27
Johann:  _/\_ Bhante Indannano

2019 May 19 11:28:39
Khemakumara: Nyom Cheav Villa

2019 May 19 11:27:48
Khemakumara:  _/\_ _/\_ _/\_ Bhante Johann  _/\_ _/\_ _/\_

2019 May 18 23:55:08
Moritz: Vandami Bhante _/\_

2019 May 18 10:34:49
amanaki: Thank you Johann  _/\_

2019 May 18 09:59:33
Johann: Nyom Amanaki. Mudita that you may have possible found what searched for on a special day.

2019 May 18 09:24:56
Maria:  _/\_

2019 May 18 09:24:35
Maria: werter Bhante!

2019 May 18 09:22:43
Johann: Nyom Mizi

2019 May 18 09:21:31
Johann: Nyom Sophorn, Nyom Villa... may all here but also there rejoice in own and others goodness.

2019 May 18 05:03:47
Cheav Villa: សាធុ​សាធុ _/\_ _/\_ _/\_

2019 May 18 02:16:49
Moritz: _/\_ _/\_ _/\_

2019 May 14 07:51:30
Vithou:  _/\_

2019 May 14 05:40:54
Johann: As long as not using telefon while riding. Sokh chomreoun, Nyom.

2019 May 13 18:38:46
Moritz: Vandami Bhante _/\_ (sitting in Taxi)

2019 May 12 15:44:32
Johann: But better ask Nyom Chanroth, since Atma does not walk that far these days.

2019 May 12 15:04:01
Johann: not teally, Nyom Vithou. Still less water in the streams here. Some still dry. Needs a while down from the mountains and not that much rain yet.

2019 May 12 14:54:37
Vithou: how is the road Bhante? Is it float at the mountain leg?

2019 May 12 14:51:59
Vithou:   _/\_

2019 May 12 14:40:43
Johann: Nyom Vithou. Nothing special. Yes, rain is present every afternoon since some days.

2019 May 12 14:38:33
Vithou: Bhante, how is everything at Asrum? Is it raining everyday?

2019 May 12 07:05:30
Cheav Villa:  _/\_ _/\_ _/\_

2019 May 12 03:58:19
Johann: a joyful day in merits on this Sila-day

2019 May 11 17:04:10
Cheav Villa:  :) _/\_

2019 May 11 16:16:56
Moritz: Bong Villa _/\_

2019 May 11 05:35:39
Cheav Villa: Sadhu Sadhu Sadhu  _/\_ _/\_ _/\_

2019 May 11 00:52:44
Johann: an meritful Uposatha, those keeping it today

2019 May 10 17:14:43
Moritz: Chom reap leah, I am going to work. _/\_

2019 May 10 17:09:07
Johann: Nyom Moritz

2019 May 10 17:07:14
Moritz: Vandami Bhante _/\_

2019 May 10 16:19:14
Moritz: Chom reap sour, bong Villa _/\_

2019 May 07 19:12:10
Johann: Nyom Vithou. Just some hours ago, thought of him.

2019 May 05 04:26:53
Chanroth:  _/\_ _/\_ _/\_

2019 May 04 11:41:08
Cheav Villa:  _/\_ _/\_ _/\_

2019 May 04 10:27:38
Khemakumara: Nyom Cheav Villa

2019 May 03 10:08:09
Khemakumara: Sadhu, sadhu, sadhu  _/\_ _/\_ _/\_

2019 May 03 01:17:53
Johann: A meritful new moon Uposatha those celebrating it today.

2019 May 03 01:16:05
Johann: Talk box is buggy and lines love to jump. Better not editing.

2019 May 03 01:14:19
Johann: U Chanroth: "ថ្ងៃនេះខ្ញុំបាទ បានទទួលនៅសម្ភារៈមួយចំនួន សម្រាប់កសាងអាស្រមថ្មទូកសូមជូនបុណ្យដល់ពុទ្ធបរិសទ័ទាំងអស់គ

2019 May 02 15:15:58
Cheav Villa:   <.I.> _/\_

2019 May 02 15:15:17
Cheav Villa: Sorry because of kh font doesn't run well on my phone. Kana go to edit  to see the right  shout but  was wrong by deleting Pou  Chanroth 's  shout

2019 May 02 15:01:04
Cheav Villa: Mudita  :) _/\_

2019 May 02 13:47:17
Moritz: Anumodana puñña kusala! _/\_

2019 May 01 14:49:38
Johann: Now some monks are so close to many, that they can be visited even by feet.

2019 May 01 06:27:25
Johann: Thats accoss the whole city and hot (but cloudy  :) ) Best wishes and greatings.

2019 May 01 06:22:36
Cheav Villa: Get lost in the jungle of Phnom Penh from 2pm till 6pm

2019 May 01 05:40:26
Cheav Villa: Bhante Khemakumara arrived at Wat Sophea Khun in late evening  _/\_ _/\_ _/\_

2019 Apr 29 09:19:56
Johann: Meister Moritz

2019 Apr 29 08:51:27
Moritz: Vandami Bhante _/\_

2019 Apr 29 03:07:40
Moritz: Chom reap leah _/\_ I am going to sleep.

2019 Apr 29 02:59:19
Cheav Villa:  _/\_

2019 Apr 29 02:41:01
Moritz: _/\_ Bong Villa

2019 Apr 29 01:02:45
Johann: let's see wheter the rain has swept away the new floor ...

2019 Apr 28 13:58:19
Cheav Villa: First time of Heavy rain in Phnom Penh  _/\_ _/\_ _/\_

2019 Apr 28 10:33:14
Cheav Villa: សាធុ​សាធុ  _/\_ _/\_ _/\_

2019 Apr 28 10:21:50
Johann: Oh, rain.

2019 Apr 28 08:46:02
Johann: Pleasing soft weather, yes  :)

2019 Apr 28 08:45:28
Johann: Atma rejoices with the may who possible are able to see a death man walking toward the deathless, and have even occation for a lot of spontan merits.

2019 Apr 28 07:31:59
Cheav Villa: May​ Bhante be well  _/\_ _/\_ _/\_  The sky has been cloudy since 10:30am

2019 Apr 28 03:02:15
Ieng Puthy: 🙏🏻🙏🏻🙏🏻 May Bhante Khemakumara walk(nimun) safely. _/\_ _/\_ _/\_

2019 Apr 27 20:51:41
Johann:  :)

2019 Apr 27 20:51:11
Johann: may it be cloudy (in the hot time) so that feet may be well

2019 Apr 27 17:53:19
Moritz: May Bhante travel safely. _/\_ _/\_ _/\_

2019 Apr 27 16:38:40
Cheav Villa: May​ the Mighty Devas protected Bhante Khemakumara along the path to Wat Sophea Khun  _/\_ _/\_ _/\_

2019 Apr 27 16:03:29
Cheav Villa: ខ្ញុំ​កូណាបាន លឺថា ព្រះអង្គ​ Kemakumara នឹង​និមន្ត ចេញពី វត្ត​អកយំ​ នៅថ្ងៃស្អែក

2019 Apr 27 16:01:32
Cheav Villa: ថ្វាយបង្គំ​ព្រះអង្គ  _/\_ _/\_ _/\_

2019 Apr 27 14:28:42
Ieng Puthy: 🙏🏻🙏🏻🙏🏻

2019 Apr 27 06:55:13
Johann: Nyom Villa

2019 Apr 27 06:54:35
Cheav Villa:   _/\_ _/\_ _/\_

2019 Apr 27 06:31:42
Johann: Nyom Moritz

2019 Apr 27 06:09:34
Moritz: Vandami Bhante _/\_

2019 Apr 27 05:42:54
Moritz: _/\_

2019 Apr 27 00:54:04
Johann: A blessed and meritful halfmoon Sila-day

2019 Apr 25 07:32:44
Ieng Puthy: 🙏🏻🙏🏻🙏🏻អរព្រះគុណ ព្រះអង្គ

2019 Apr 25 04:42:51
Johann: Sokh chomreoun, Nyom. (May well-being come to fullfillment.)

2019 Apr 25 02:30:46
Ieng Puthy: តេីលោកRoman មានបំណងទៅវត្តអកយំនៅថ្ងៃណាដែរ?ព្រះអង្គ🙏🏻🙏🏻🙏🏻

2019 Apr 25 02:29:26
Ieng Puthy: ករុណានិង បងសុភឿន នឹងជូនលោកRoman ទៅវត្តអកយំបាន

2019 Apr 25 02:28:00
Ieng Puthy: ករុណានិង បង សុភឿន នឹងជួយស

2019 Apr 25 02:27:00
Ieng Puthy: 🙏🏻🙏🏻🙏🏻ករុណាថ្វាយបង្គំុព្រះអង្គ Vandami Bhante

2019 Apr 24 17:56:05
Cheav Villa: កូណា សរសេរពួកយើង​ គឺជំនួសមុខ​ បងពុទ្ធីនិងសុភឿន  _/\_

2019 Apr 24 17:54:42
Cheav Villa: បង​ពុទ្ធី បានអោយកូណាសួរអំពីពេលវេលា​ ដែលលោកRoman នឹងទៅអកយំ _/\_

2019 Apr 24 17:52:47
Cheav Villa:  _/\_ _/\_ _/\_  កូណាបាន ប្រាប់បងពុទ្ធី និង​សុ​ភឿន ប្រសិនបើគាត់អាចជួយបាន ព្រោះកូណាមិនមានសេរីភាពច្រើនដូចពួកគាត់

2019 Apr 24 17:01:34
Johann: Modern (ab)art of conversation and old patient culture...  :) great training only serious take on and rushing hide on messanger, fb, or in the ocean of Maras internet. Mudita.  :)

2019 Apr 23 13:36:18
Cheav Villa: Kana :D _/\_

2019 Apr 23 13:24:57
Johann: ? But light is always good. Oh, maybe the honey bee candles...: Atma told Upasika Sophorn to take them with her to share, since the mices would eat them away here. Mudita

2019 Apr 23 12:52:51
Cheav Villa: Kana Preah Ang  _/\_ Vithou told kana that Bhante sending us a pair of candles all through Bang Sophorn  :D _/\_

2019 Apr 23 12:06:15
Johann: Nyom Villa. Atma does not understand all circumstances but much mudita and appreciantion with sharing merits with each other, taking each other along good.

2019 Apr 23 11:04:23
Cheav Villa: កូណា ទើបបានដំណឹងពី Vithou ថាព្រះអង្គផ្ញើទានមួយគូមក តាមរយៈ​បងសុភ័ណ​ ខ្ញុំកូណា​សូម​អរព្រះគុណ​  :) _/\_

2019 Apr 23 11:02:19
Cheav Villa: ថ្វាយបង្គំ​ព្រះអង្គ  _/\_ _/\_ _/\_

2019 Apr 23 02:03:31
Johann: Nyom. (smilies of the common places are not visible here for many)

2019 Apr 22 17:36:16
Ieng Puthy: 🙏🏻🙏🏻🙏🏻ករុណាសូមថ្វាយបង្គំុ Vandami Bhante

2019 Apr 22 15:54:07
Cheav Villa: Master Moritz  _/\_

2019 Apr 22 15:03:17
Moritz: _/\_ Bong Villa

2019 Apr 20 07:30:33
Moritz: Vandami Bhante _/\_

2019 Apr 20 05:25:34
Moritz: _/\_ bong Vithou

2019 Apr 19 06:30:18
Cheav Villa:  _/\_

2019 Apr 19 06:25:58
Moritz: _/\_ bong Villa

2019 Apr 19 06:25:48
Moritz: _/\_ _/\_ _/\_ Bhante

Tipitaka Khmer

 Please feel welcome to join the transcription project of the Tipitaka translation in khmer, and share one of your favorite Sutta or more. Simply click here or visit the Forum: 

Search ATI on ZzE

Zugang zur Einsicht - Schriften aus der Theravada Tradition



Access to Insight / Zugang zur Einsicht: Dhamma-Suche auf mehr als 4000 Webseiten (deutsch / english) - ohne zu googeln, andere Ressourcen zu nehmen, weltliche Verpflichtungen einzugehen. Sie sind für den Zugang zur Einsicht herzlich eingeladen diese Möglichkeit zu nutzen. (Info)

Random Sutta
Random Article
Random Jataka

Zufälliges Sutta
Zufälliger Artikel
Zufälliges Jataka


Arbeits/Work Forum ZzE

"Dhammatalks.org":
[logo dhammatalks.org]
Random Talk
[pic 30]

Zugang zur Einsicht - Übersetzung, Kritik und Anmerkungen

Herzlich Willkommen im Arbeitsforum von zugangzureinsicht.org im Onlinekloster sangham.net!


Danke werte(r) Besucher(in), dass Sie von dieser Möglichkeit Gebrauch machen und sich direkt einbringen wollen.

Unten (wenn Sie etwas scrollen) finden Sie eine Eingabemaske, in der Sie Ihre Eingabe einbringen können. Es stehen Ihnen auch verschiedene Gestaltungsmöglichkeiten zur Verfügung. Wenn Sie einen Text im formatierten Format abspeichern wollen, klicken Sie bitte das kleine Kästchen mit dem Pfeil.

Die Textfelder "Name" und "email" müssen ausgefüllt werden, Sie können hier aber auch eine Anonyme Angabe machen und eine Pseudo-email angeben (geben Sie, wenn Sie Rückantwort haben wollen, jedoch einen Kontakt an), wenn Ihnen das unangenehm ist. Der Name scheint im Forum als Text auf und die Email ist von niemanden außer dem Administrator einsehbar.

Wenn Sie den Text fertig geschrieben haben, müssen Sie noch den Spamschutz überwinden, das Bild zusammen setzen, und dann auf "Vorschau" oder "Senden" drücken, wenn für Sie alles passt.

Wenn Sie eine Spende einer Übersetzung machen wollen, wäre es schön, wenn Sie etwas vom Entstehen bzw. deren Herkunft erzählen und Ihrer Gabe vielleicht noch eine Widmung anhängen.

Gerne, so es möglich ist, werden wir Ihre Übersetzung dann auch den Seiten von Zugang zur Einsicht veröffentlichen. Für generelle Fragen zu dem Umfang der Dhamma-Geschenke auf ZzE sehen Sie bitte in den FAQ von ZzE ein.

Gerne empfangen wir Kritik und selbstverständlich auch Korrekturen oder Anregungen hier. Es steht Ihnen natürlich offen und Sie sind dazu herzlich eingeladen auch direkt mit einem eigenen Zugang hier an den Arbeiten vielleicht direkt teilzunehmen.

Sadhu!

metta & mudita
Ihr Zugang zur Einsicht Team

Um sich im Abeitsforum etwas unzusehen, klicken Sie hier. . Sie finden hier viele Informationen und vielleicht sogar neues rund um Zugang zur Einsicht.

Author Topic: [ATI.eu] CSCD xml to ati.eu format: converting, editing  (Read 9156 times)

0 Members and 1 Guest are viewing this topic.

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #75 on: March 11, 2019, 07:23:36 AM »
Having already found one of an potential content eater, having forgotten to escape dot

<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[. ]*<hi rend=[^\w]dot[^\w]>[. ]*<\/hi>[. ]*([^\n]*)<\/p>[\s]*   <div hangnum><span para #para_$1>[$2]</span></div> $3\n\n
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #76 on: March 12, 2019, 01:23:18 PM »
Status (lokal)

Some files have been re-renamed. Current list: Renaming of source files , renaming files.

Regex-list for xml- to ati-standard as done for "cs-rm", "cs-km", "cs-th", "cs-ru" at once.

Note that {...} strings will be replaced in a later selective session. Replacments are done "single-line" if not other mentioned.

##Starting with the header and footer, which replaces "content".

###HEADER multiline (10792 replacements)

Code: [Select]
[\s]*<\?xml(.+?)<body>[\s]*

<span hide>sources: cs-file name {cs file} path ati{lang}:{ns-section}:{file}</span>\n{{section>en:tech:template_includes#{lang}_header&nouser&nodate&noheader&noeditbutton&firstsectiononly}}\n<div {lang}>\n\n

###FOOTER multiline

Code: [Select]
[\s]*<\/body>(.*)<\/([^p]*?)>[\s]*

\n\n</div>\n{{section>en:tech:template_includes#{lang}_footer&nouser&nodate&noheader&noeditbutton&firstsectiononly}}

###CS-CD ANCHORS

Code: [Select]
<pb ed=[^\w]([^<>]*?)[^\w] n=[^\w]([^<>]*?)[^\w][\s]*\/>

<span anchor #$1_$2></span>

###BOLD

Code: [Select]
<hi rend=[^\w]bold[^\w]>([^\n]+?)<\/hi>

**$1**

###P CENTRE

Code: [Select]
<p rend=[^\w]centre[^\w]>(.*?)<\/p>

<div centeralign>$1</div>

###NOTE

Code: [Select]
<note>([^\n]+?)<\/note>

<span note>$1</span>

###P HI PARANUM DOT

Code: [Select]
<p rend=[^\w]bodytext[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<hi rend=[^\w]dot[^\w]>\.<\/hi>([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###P HI PARANUM DOT []

Code: [Select]
<p rend=[^\w]bodytext[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)[\. ]*?<\/hi>[\. ]*?([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span> $3\n\n

###P

Code: [Select]
<p rend=[^\w]bodytext[^\w]>([^\n]+?)<\/p>[\s]*

$1\n\n

###P HI PARANUM DOT

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]*<hi rend=[^\w]dot[^\w]>[\. ]*<\/hi>[\. ]*([^\n]*)<\/p>[\s]*

<div hangnum><span para #para_$1>[$2]</span></div> $3\n\n

###GATHA

Code: [Select]
<p rend=[^\w]gatha([^<>]*?)[^\w]>([^\n]+)<\/p>[\s]*

<div gatha$1>$2</div>\n\n

###INDENT|UNINDENTED

Code: [Select]
<p rend=[^\w](indent|unindented)[^\w]>([^\n]+)<\/p>[\s]*

<div $1>$2</div>\n\n

###NIKAYA

Code: [Select]
<p rend=[^\w]nikaya[^\w]>([^<>]*?)<\/p>

<div centeralign #nikaya>**$1**</div>\n<span sang_id #{file--}>[[{path-release}:{file--}|{file--}]] | [[{path-source}:{file}#{file--}|source]]</span>

###BOOK 868

Code: [Select]
<p rend=[^\w]book[^\w]>([^<>]*?)<\/p>

======== $1 ========\n<span sang_id #{file-}>[[{path-release}:{file-}|{file-}]] | [[{path-source}:{file}#{file-}|source]]</span>

###CHAPTER

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<\/p>

======= $1 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>

###TITLE

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<\/p>

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>

###SUBHEAD

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<\/p>

==== $1 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>

###SUBSUBHEAD

Code: [Select]
<p rend=[^\w]subsubhead[^\w]>([^<>]*?)<\/p>

=== $1 ===\n<span sang_id #{file-}.{no+}>[[{path-release}:{file-}.{no+}|{file-}.{no+}]] | [[{path-source}:{file}#{file-}.{no+}|source]]</span>

###SUBHEAD NOTE

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

==== $1$3 ====\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>

###CHAPTER NOTE

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

======= $1$3 =======\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>

###TITLE NOTE

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

===== $1$3 =====\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>

###SUBHEAD ANCHOR

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

==== $1$3 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n<span span anchor #$2</span>

###CHAPTER ANCHOR

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

======= $1$3 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>\n<span span anchor #$2</span>

###TITLE ANCHOR

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

===== $1$3 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>\n<span span anchor #$2</span>

###BOOK ANCHOR

Code: [Select]
<p rend=[^\w]book[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

======== $1$3 ========\n<span sang_id #{file-}>[[{path-release}:{file-}|{file-}]] | [[{path-source}:{file}#{file-}|source]]</span>\n<span span anchor #$2</span>

This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #77 on: March 15, 2019, 10:39:40 AM »
Further edits:

###DOT

Code: [Select]
<hi rend="dot">\.</hi>

.

###HANGUM INTO HEADER (JAT only) multiline

Code: [Select]
===== ([^<>]*?) =====\n<span sang_id #([^\n]*?)</span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)\n

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>$3==== $4. $5 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n

###HANGUM INTO HEADER (JAT only) multiline [..] X.

Code: [Select]
===== ([^<>]*?) =====\n<span sang_id #([^\n]*?)</span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ \[([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\] ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)\n

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>$3==== [$4] $5. $6 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n

###HANGUM INTO HEADER HH (JAT only) multiline

Code: [Select]
======= ([^<>]*?) =======[\r\n]+<span sang_id #\{file\}>\[\[\{path-release\}:\{file\}\|\{file\}\]\] \| \[\[\{path-source\}:\{file\}#\{file\}\|source\]\]<\/span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)[\r\n]+

======= $1 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>$2==== $3. $4 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n\n

###HANGUM INTO HEADER HH (JAT only) no NO.

Code: [Select]
<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([^\n]*?)[\r\n]+

==== $1 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n\n

###HANGNUM CORR (exception in bud-vgs.nk.2_any.txt and sut.sn.01.txt!!)

Code: [Select]
<p rend=[^\w]hangnum[^\w]>([^<>]+?)\.<\/p>

<div hangnum>$1.</div>

###Search "<p rend=[^\w]hangnum[^\w]>" further 47 hits in 39 files: best made one by one since many exceptions.

###BOLD corrections

without regex:

Code: [Select]
]</span> .

]</span>

correction before, ###HANGNUM, again

###P HANGNUM HI PARANUM BOLD

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<hi rend=[^\w]bold[^\w]>\.<\/hi>([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###further <hi rend="bold"> corr. are made on the single pages

###P HANGNUM HI PARANUM

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<\/p>[\s]*

<span para #para_$1>[$2]</span>\n\n

###P INTENT PARANUM

Code: [Select]
<p rend=[^\w]indent[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>\. ([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span> $3\n\n

###P HANGNUM HI PARANUM content

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###GATHA PARANUM

Code: [Select]
<div gatha1[^\w] n=[^\w]([0-9]*?)><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]*([^\n]*?)</div>

<span para #para_$1>[$2]</span>\n\n<div gatha1>$3</div>

###Correction

Code: [Select]
<div gatha2" n="-><hi rend="paranum">-</hi>

<div gatha2>

###Manual corrections for all matches of "<p rend"

###Cleanings

Code: [Select]
\r\n

\n

There might be further xml-tags left and small edits needed, but those can be made online.

Atma will now replace the placeholder (except {no}, {no+}) where he has no idea of how to process that right and effective for now, and then upload all files anew.

(Note: working/processing on replacements with batchedit online is much faster as with notepad++ local (about a 3-4 days). Of course the cleaning of cache and delete of history online takes the also a good while.)
« Last Edit: March 15, 2019, 10:44:41 AM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #78 on: March 15, 2019, 10:58:27 AM »
And using Powershell such as ((Get-Content vin.par.ve.txt -Raw) -replace '{lang}','cs-km') | Set-Content vin.par.ve.txt destroyed the files, possible a utf-8-issue... (and having not made a backup...)

all once again  ^-^ :)
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #79 on: March 16, 2019, 01:02:54 AM »
Atma will upload the renamed files with original content and try again to make the replacements online with batchedit, since having come across that Notepad sometimes loses found matches and gives nothing back when replacing.
In this way, at least, the originals would be stored on ati as well. Lets see whether web-space and sun allows it the next days.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #80 on: March 16, 2019, 04:25:30 PM »
Files are all anew uploaded so far. The Khmer files need some rest replacements of xml codes. Renamed files have been deleted.

Once the index is rebuild, the last replacements can be made.

As for the replacements of the placeholder {file}, {ns-section}... it's maybe good if runing similar scripts on the server.

In regard of {no}: no over all idea for now, so maybe good as before.

Attached an excel-list containing all particular replacements for each single file.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #81 on: March 17, 2019, 11:41:59 AM »
List of renaming of the index files (toc.xml): renaming_files#index-files_toc
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #82 on: March 23, 2019, 11:45:38 AM »
Main indexes in the four scripts should be fine and complete now:

Tipiṭaka (Roman)
តិបិដក (បាឡិ​ខ្មែរ) ติปิฎก (Thai) д̇ибидага (кириллица)
My person currently ties to rebuild the index by actualization option, which actually seems to be double slower as to build anew, but possible would not aim in no index when stopping in between (about 3000 pages of 20500 indexed since this morning)
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
from: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #83 on: April 01, 2019, 12:23:19 PM »

Aramika   *

Ein oder mehrer Beiträge wurden hier im Thema abgeschnitten und damit in neues Thema "[ati.eu] Indexing, search engine " eröffnet, dem angehäng.
« Last Edit: April 01, 2019, 12:29:19 PM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #84 on: April 02, 2019, 08:18:28 AM »
{lang} and {ns-section} have now replaced on all pages except the 416 pages in cs-th (Thai, 268 in Atthakatha and 148 pages in Tika)

The further replacements ({file}, {path-source}...) could be made according the list above either page for page or with a script using the list. Files+/- etc, how ever, may need further renderings later. {no}... the same.

Sadhu for the great work and assitence of many to bring the first four languages into here and the availability for the Sangha and those with Nissaya.

Atma will look after the last xml converting into ati-syntax in the Khmer pages and then look after the css for "good" layouts.

An Excel-file which is of help for creation of the release files, also in languages to come, can be used: renaming_list.xlsx To extract them into directories and files for an upload the Converting lists into txt-files - Tools for Ati.eu can be used.
« Last Edit: April 07, 2019, 01:24:01 PM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +361/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #85 on: April 11, 2019, 12:38:23 PM »
Currently working on the "single-sutta release" files, which can require some time, given about 40.000 headers, but would then also give finally values for the {no..} replacements (for links to them) in the source-files.

Since making single files for Atthakatha and Tika would cause huge amount of files, if not skipping, and so Atma thought of implementing the related commentaries direct in the Sutta (Mula) files.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Tags: