Recent Topics

[Today at 03:55:47 PM]

[Today at 02:25:39 PM]

[Today at 02:05:50 PM]

[Today at 12:18:41 PM]

[Today at 07:56:58 AM]

[Today at 06:00:12 AM]

[Today at 04:08:33 AM]

[Today at 12:37:40 AM]

[March 24, 2019, 12:26:55 PM]

[March 24, 2019, 08:29:57 AM]

[March 20, 2019, 06:59:48 PM]

[March 20, 2019, 01:12:07 AM]

[March 18, 2019, 11:10:31 AM]

[March 17, 2019, 06:09:06 AM]

[March 17, 2019, 12:14:09 AM]

[March 16, 2019, 04:47:08 AM]

[March 15, 2019, 04:34:31 PM]

[March 15, 2019, 04:32:21 PM]

[March 15, 2019, 04:29:51 PM]

[March 15, 2019, 04:28:17 PM]

[March 15, 2019, 04:26:30 PM]

[March 15, 2019, 04:20:06 PM]

[March 15, 2019, 04:18:22 PM]

[March 15, 2019, 04:17:08 PM]

[March 15, 2019, 03:57:05 AM]

[March 15, 2019, 03:56:11 AM]

[March 15, 2019, 03:55:08 AM]

[March 15, 2019, 03:54:13 AM]

[March 15, 2019, 03:52:55 AM]

[March 14, 2019, 03:22:34 PM]

[March 14, 2019, 03:10:03 PM]

[March 14, 2019, 03:06:17 PM]

[March 14, 2019, 10:18:44 AM]

[March 14, 2019, 08:33:43 AM]

[March 14, 2019, 08:16:10 AM]

Talkbox

2019 Mar 25 15:28:24
Johann: Others, even if pure, is open for critic and bad assumings. Envy, jealously... on each side. So to become not susbected, needs good advices, double hard for woman to act perfect.

2019 Mar 25 15:23:21
Johann: Pappies can give good lessons when they don't fall, but the deeper, the hard training, must come from liberal mommies for daughters and liberal pappies for sons, in the world and for beyond.

2019 Mar 25 15:11:15
Cheav Villa:   _/\_ _/\_ _/\_

2019 Mar 25 15:04:40
Johann: Does mommy generally don't actually love her child when not react?

2019 Mar 25 15:03:13
Johann: Mommy, mommy... but I like... ohh, they bite me... I am hungry... when do we go back?... why must we sit here... mommy!

2019 Mar 25 15:00:37
Johann: Which can be changed all the time, with feeding rightly.

2019 Mar 25 14:55:50
Cheav Villa: កូណាសង្ឃឹមថា.​ ជារឿងឧបនិស្ស័យ.​  :D _/\_ _/\_ _/\_

2019 Mar 25 14:50:25
Johann: ...teaching and mirror.

2019 Mar 25 14:49:40
Johann: When one is stressed or busy, remember how many times one him/herself was so as well. Intention will always has it's effects, sometimes quick, sometimes even sisters, mother and child... Nyom wouldn't have become here, if there would be not long, long bounds. Watch your child. Thats always a great t

2019 Mar 25 14:40:41
Cheav Villa: _/\_ _/\_ _/\_

2019 Mar 25 14:39:48
Cheav Villa: ព្រោះកូណាមិនអាចទៅឱរ៉ាល់បានកំឡុងពេលនេះ តែអាចជួបនៅភ្នំពេញបាន តែចេតនាម្ខាងមិនចង់ជួប ទើបមិនអាចជួប

2019 Mar 25 14:38:39
Cheav Villa: កូណាគ្រាន់តែចង់បញ្ជាក់អំពីចេតនា ដែលម្នាក់មាន និងម្នាក់គ្មាន ក្នុងការ​ជួបគ្នា.ជាហេតុមិនបានជួប :D _/\_

2019 Mar 25 14:37:02
Cheav Villa:    _/\_ _/\_ _/\_

2019 Mar 25 13:44:16
Johann: Nyom Vithou. Foot well again?

2019 Mar 25 11:56:36
Johann: Good stories: The Healing of the Bull and Prisoners of Karma .

2019 Mar 25 11:51:20
Johann: Although monks are allowed to explain how to work oneself out, they are not given to liberate "slaves", prisoner, as this would count as thief.

2019 Mar 25 11:46:29
Johann: When one is "bond" in families, relations... there is so much soil and danger for envy and jealously, and itjs not easy to "pay" one out. Normal being live from making others depending on one and fear to lose a "cattle" they gave so much into.

2019 Mar 25 09:53:57
Khemakumara: Nyom Cheav Villa

2019 Mar 25 05:00:43
Johann: Some pull back and offer favors, less are giving ways out. Let them wishing to live in cities go back, maintaining homes. Bond by mara there is no escape, they leave alm bowls behind and step into cars, to "help".

2019 Mar 25 04:11:30
Johann: One (who ever) gives the best when helping, assist, in sharing Dhamma, translate it, sort it, easy accessable and maintain it, by proper and given means.

2019 Mar 25 04:03:32
Cheav Villa:  _/\_ _/\_  _/\_

2019 Mar 25 03:53:23
Johann: One stopped at the topic Sangahak, worldily and best. It's hard to try to translate but bears a lot of fruits. Conceit is dangerous, and a feeling of "right" turns quick into lose. Satipatthana, the right workingplaces. And again Ways of  

2019 Mar 25 02:53:11
Kong Sokdina: ដំណើរផ្លូវឆ្ងាយ មិនប្រមាណ បាននូវសេចក្ដីប្រមាថ។

2019 Mar 24 18:54:30
Cheav Villa: ជាចិត្តលំអៀងព្រោះជំពាក់ក្នុងសេចក្តីស្រលាញ់សាច់ញាតិ​ បងប្អូន.​ ឪពុកម្ដាយ​  ^-^

2019 Mar 24 18:49:47
Cheav Villa: ជារឿងដដែល យូរណាស់មកហើយ  ខ្ញុំ​កូណាគួរតែបានរកឃើញផ្លូវកណ្តាល​  :) _/\_

2019 Mar 24 18:48:21
Cheav Villa: ការជាប់ជំពាក់នៅក្នុងការជួយអ្នកដទៃ ការជាប់ជំពាក់នៅក្នុងការចង់អោយគេបានសុខ តែងនាំសេចក្តីទុក្ខ ដល់ខ្លួន

2019 Mar 24 13:23:44
Johann: Nyom Senghour

2019 Mar 23 06:34:02
Johann: No problem Nyom Vithou.

2019 Mar 23 04:32:25
Vithou: ngyom kuna will inform Preah Ang later when kuna get better

2019 Mar 23 04:31:12
Vithou: Preah Ang, ngyom kuna cannot go Phnom Aural tomorrow due to my foot problem. I let is a bit pain and Ngyom kuna cannot walk properly

2019 Mar 20 19:02:36
Johann: May Nyom have a safe and careful travel, when ever he might do.

2019 Mar 20 18:46:53
Johann: Now, Nyom Moritz and Nyom Sophorn are here in the forest. Atma does not know how long.

2019 Mar 20 18:20:42
Vithou: Nyom Kuna planning to see Moritz and Bong Sophorn as well

2019 Mar 20 18:17:26
Vithou: Nyom Kuna planning to go Phnom Oral on Sunday

2019 Mar 20 18:16:07
Vithou: Nyom Kuna was sick too long after remove kidney stone . ( about 2 months ) and skittle busy with human job

2019 Mar 20 18:13:50
Vithou: Is Moritz and Bong Sophorn at Asram now?

2019 Mar 20 18:12:02
Vithou: Kuna Preah Ang

2019 Mar 20 18:05:01
Johann: Nyom Vithou

2019 Mar 20 12:25:25
Johann: here and now

2019 Mar 20 09:04:34
Cheav Villa: To reach the end of the cosmos to go beyond   _/\_

2019 Mar 20 08:49:03
Johann: Oh.. wrong, here: Rohitassa Sutta

2019 Mar 20 08:45:24
Johann: There is nothing to far that the mind, oneself, could not reach it in the Universe. Yet end of suffering can not found. But it requires to reach the end of the cosmos to go beyound. Lokayatika Sutta

2019 Mar 20 08:14:55
Chanroth:  _/\_ _/\_ _/\_

2019 Mar 20 06:34:16
Ieng Puthy: 🙏🏻🙏🏻🙏🏻ករុណាសូមថ្វាយបង្គំុ Vandami Bhante ពីចំងាយ

2019 Mar 20 06:32:27
Ieng Puthy: 🙏🏻🙏🏻🙏🏻 សាធុុ សាធុ សាធុ! ថ្ងៃឧបោសថ ករុណាសូមចូលរួមត្រេកអរដែរ😇🙏🏻🙏🏻🙏🏻

2019 Mar 20 06:13:30
Johann: Mudita

2019 Mar 20 05:52:37
Cheav Villa: ថ្ងៃឧបោសថ​  :)​ កូណាចូលរួមត្រេកអរដល់សេរីភាពរបស់ពួកគាត់  _/\_

2019 Mar 19 13:56:41
Cheav Villa:  _/\_ _/\_  _/\_

2019 Mar 19 08:47:34
Johann: Knowing this, one is able to share family (of supporter), honor, dwelling, gains, dhamma, having a clear sense of gratitude.

2019 Mar 19 08:44:19
Johann: And what is the reason of macchariya disapearing? Knowning, remembering, form, sound...ideas are no refuge, not real, not worthy to fall for, not ones own.

2019 Mar 19 08:41:50
Johann: "May macchariya (stinginess) become its needed reduce, so that the goodness of Dhamma may reaches more intensive", is that right translated? Sadhu. Not an easy task althought one might think.

2019 Mar 19 08:07:30
Cheav Villa: សូមអោយមច្ឆរិយ.​ ត្រូវបានកាត់បន្ថយ ដើម្បីគុណធម៌ដែលខ្ពស់ជាង _/\_ _/\_ _/\_

2019 Mar 19 01:39:39
Johann: May all have always a good travel, meet each other in best furtune along there ways. May no one get hurt in the traffic after happiness.

2019 Mar 18 10:21:51
Moritz: hello empty yogi _/\_

2019 Mar 18 10:21:23
Moritz: Vandami bhante _/\_

2019 Mar 18 10:21:22
Moritz: Vandami bhante _/\_

2019 Mar 16 08:19:42
Johann:  _/\_ Bhante Indannano

2019 Mar 15 04:20:37
Johann: Nyom Buddhi

2019 Mar 14 16:00:28
Cheav Villa: កូណា​ថ្វាយបង្គំ​លា​ សូមព្រះអង្គឆាប់បានសំរាក​  _/\_ _/\_ _/\_

2019 Mar 14 09:13:11
Kong Sokdina: Hello!

2019 Mar 14 08:35:17
Johann: Nyom Kong Sokdina

2019 Mar 14 08:10:07
Johann: A meritful Sila-day those who observe it today

2019 Mar 13 11:18:19
Cheav Villa:  _/\_ _/\_ _/\_

2019 Mar 13 09:23:36
Johann: A meritful Sila-day those who observe it today, btw.

2019 Mar 10 02:03:42
Cheav Villa:  _/\_

2019 Mar 09 07:45:59
Moritz: Good morning, Cheav Villa _/\_

2019 Mar 07 12:10:15
Cheav Villa: កូណាព្រះអង្គ  _/\_

2019 Mar 07 12:05:44
Johann: My person is not sure but Nyom Chanroths wife might try to go back from PP tomorrow as well, Nyom.

2019 Mar 07 01:21:24
Cheav Villa:  _/\_ _/\_ _/\_

2019 Mar 07 01:18:59
Johann: May you have a safe and slow travel.

2019 Mar 07 01:10:20
Cheav Villa: ចេញពី​ភ្នំពេញ​នៅម៉ោង5ព្រឹក :)  _/\_

2019 Mar 07 01:09:25
Cheav Villa: ពួកខ្ញុំកូណា និង បង​ភឿន​ នឹងទៅសួរសុខទុក្ខព្រះអង្គនៅថ្ងៃស្អែក

2019 Mar 07 01:08:34
Johann: Nyom Villa

2019 Mar 07 01:07:32
Cheav Villa: ថ្វាយបង្គំ​ព្រះអង្គ  _/\_ _/\_ _/\_

2019 Mar 06 03:31:04
Cheav Villa: Have a good day to Master Moritz  _/\_

2019 Mar 06 03:29:38
Moritz: And good night for me. :) Chom reap leah. May all have a good Uposatha _/\_

2019 Mar 06 03:27:51
Moritz: Good morning, Cheav Villa _/\_

2019 Mar 06 03:17:49
Moritz: Vandami Bhante _/\_

2019 Mar 06 03:05:26
Johann: A self-blessed (by ones deeds) Uposathaday today.

2019 Mar 05 11:33:43
Johann: Nyom Chanroth had to stop the work. To difficult to concentrate today. Good training anyway.

2019 Mar 05 10:20:56
Cheav Villa:  :-\ :D _/\_

2019 Mar 05 05:26:54
Johann: A meritful Uposatha those who observe it today.

2019 Mar 05 03:13:37
Johann: Sadhu, Sadhu. Rupam anicca

2019 Mar 05 01:47:58
Ieng Puthy: ករុណាបារម្ភអំពីសុខភាពរបស់ព្រះអង្គពីចំងាយ 🙏🏻ករុណាសូមប្រគេនពរ ព្រះអង្គឆាប់ជាសះស្បេីយ

2019 Mar 05 01:38:52
Ieng Puthy: ករុណាឮថា ព្រះអង្គអាពាធ ជង្គង់ តេីព្រះអង្គបានធូរស្បេីយហេីយឬនៅ?

2019 Mar 05 01:07:09
Ieng Puthy: Good morning ! 🙏🏻🙏🏻🙏🏻 ករុណាថ្វាយបង្គំុ Vandami Bante

2019 Mar 04 14:41:21
Johann: Sukha Chomreoun, Nyom

2019 Mar 04 14:18:03
Moritz: Leaving now. May Bhante have a good day. _/\_

2019 Mar 04 13:45:50
Johann: Nyom Moritz

2019 Mar 04 13:19:43
Moritz: Vandami Bhante _/\_

2019 Mar 01 16:35:39
Johann: Bhante Indannano  _/\_ How was India in "war times"

2019 Feb 28 11:46:56
Moritz: And good bye again. :) _/\_

2019 Feb 28 11:46:38
Moritz: Vandami, Bhante _/\_

2019 Feb 28 05:25:27
Cheav Villa:   :D_/\_

2019 Feb 28 04:22:46
Johann: 5 o'clock morning  ^-^ (like a taxi driver) may Nyom has well earned and peaceful rest. Mudita.

2019 Feb 28 04:08:09
Moritz: Chom reap leah. I'm going to sleep. _/\_

2019 Feb 28 03:40:13
Johann: Sukha chomreoun Nyom.

2019 Feb 28 03:39:06
Moritz: Vandami, Bhante _/\_

2019 Feb 28 03:35:39
Moritz: Good morning, Cheav Villa _/\_

Tipitaka Khmer

 Please feel welcome to join the transcription project of the Tipitaka translation in khmer, and share one of your favorite Sutta or more. Simply click here or visit the Forum: 

Search ATI on ZzE

Zugang zur Einsicht - Schriften aus der Theravada Tradition



Access to Insight / Zugang zur Einsicht: Dhamma-Suche auf mehr als 4000 Webseiten (deutsch / english) - ohne zu googeln, andere Ressourcen zu nehmen, weltliche Verpflichtungen einzugehen. Sie sind für den Zugang zur Einsicht herzlich eingeladen diese Möglichkeit zu nutzen. (Info)

Random Sutta
Random Article
Random Jataka

Zufälliges Sutta
Zufälliger Artikel
Zufälliges Jataka


Arbeits/Work Forum ZzE

"Dhammatalks.org":
[logo dhammatalks.org]
Random Talk
[pic 30]

Zugang zur Einsicht - Übersetzung, Kritik und Anmerkungen

Herzlich Willkommen im Arbeitsforum von zugangzureinsicht.org im Onlinekloster sangham.net!


Danke werte(r) Besucher(in), dass Sie von dieser Möglichkeit Gebrauch machen und sich direkt einbringen wollen.

Unten (wenn Sie etwas scrollen) finden Sie eine Eingabemaske, in der Sie Ihre Eingabe einbringen können. Es stehen Ihnen auch verschiedene Gestaltungsmöglichkeiten zur Verfügung. Wenn Sie einen Text im formatierten Format abspeichern wollen, klicken Sie bitte das kleine Kästchen mit dem Pfeil.

Die Textfelder "Name" und "email" müssen ausgefüllt werden, Sie können hier aber auch eine Anonyme Angabe machen und eine Pseudo-email angeben (geben Sie, wenn Sie Rückantwort haben wollen, jedoch einen Kontakt an), wenn Ihnen das unangenehm ist. Der Name scheint im Forum als Text auf und die Email ist von niemanden außer dem Administrator einsehbar.

Wenn Sie den Text fertig geschrieben haben, müssen Sie noch den Spamschutz überwinden, das Bild zusammen setzen, und dann auf "Vorschau" oder "Senden" drücken, wenn für Sie alles passt.

Wenn Sie eine Spende einer Übersetzung machen wollen, wäre es schön, wenn Sie etwas vom Entstehen bzw. deren Herkunft erzählen und Ihrer Gabe vielleicht noch eine Widmung anhängen.

Gerne, so es möglich ist, werden wir Ihre Übersetzung dann auch den Seiten von Zugang zur Einsicht veröffentlichen. Für generelle Fragen zu dem Umfang der Dhamma-Geschenke auf ZzE sehen Sie bitte in den FAQ von ZzE ein.

Gerne empfangen wir Kritik und selbstverständlich auch Korrekturen oder Anregungen hier. Es steht Ihnen natürlich offen und Sie sind dazu herzlich eingeladen auch direkt mit einem eigenen Zugang hier an den Arbeiten vielleicht direkt teilzunehmen.

Sadhu!

metta & mudita
Ihr Zugang zur Einsicht Team

Um sich im Abeitsforum etwas unzusehen, klicken Sie hier. . Sie finden hier viele Informationen und vielleicht sogar neues rund um Zugang zur Einsicht.

Author Topic: [ATI.eu] CSCD xml to ati.eu format: converting, editing  (Read 5679 times)

0 Members and 1 Guest are viewing this topic.

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +360/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #75 on: March 11, 2019, 07:23:36 AM »
Having already found one of an potential content eater, having forgotten to escape dot

<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[. ]*<hi rend=[^\w]dot[^\w]>[. ]*<\/hi>[. ]*([^\n]*)<\/p>[\s]*   <div hangnum><span para #para_$1>[$2]</span></div> $3\n\n
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +360/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #76 on: March 12, 2019, 01:23:18 PM »
Status (lokal)

Some files have been re-renamed. Current list: Renaming of source files , renaming files.

Regex-list for xml- to ati-standard as done for "cs-rm", "cs-km", "cs-th", "cs-ru" at once.

Note that {...} strings will be replaced in a later selective session. Replacments are done "single-line" if not other mentioned.

##Starting with the header and footer, which replaces "content".

###HEADER multiline (10792 replacements)

Code: [Select]
[\s]*<\?xml(.+?)<body>[\s]*

<span hide>sources: cs-file name {cs file} path ati{lang}:{ns-section}:{file}</span>\n{{section>en:tech:template_includes#{lang}_header&nouser&nodate&noheader&noeditbutton&firstsectiononly}}\n<div {lang}>\n\n

###FOOTER multiline

Code: [Select]
[\s]*<\/body>(.*)<\/([^p]*?)>[\s]*

\n\n</div>\n{{section>en:tech:template_includes#{lang}_footer&nouser&nodate&noheader&noeditbutton&firstsectiononly}}

###CS-CD ANCHORS

Code: [Select]
<pb ed=[^\w]([^<>]*?)[^\w] n=[^\w]([^<>]*?)[^\w][\s]*\/>

<span anchor #$1_$2></span>

###BOLD

Code: [Select]
<hi rend=[^\w]bold[^\w]>([^\n]+?)<\/hi>

**$1**

###P CENTRE

Code: [Select]
<p rend=[^\w]centre[^\w]>(.*?)<\/p>

<div centeralign>$1</div>

###NOTE

Code: [Select]
<note>([^\n]+?)<\/note>

<span note>$1</span>

###P HI PARANUM DOT

Code: [Select]
<p rend=[^\w]bodytext[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<hi rend=[^\w]dot[^\w]>\.<\/hi>([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###P HI PARANUM DOT []

Code: [Select]
<p rend=[^\w]bodytext[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)[\. ]*?<\/hi>[\. ]*?([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span> $3\n\n

###P

Code: [Select]
<p rend=[^\w]bodytext[^\w]>([^\n]+?)<\/p>[\s]*

$1\n\n

###P HI PARANUM DOT

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]*<hi rend=[^\w]dot[^\w]>[\. ]*<\/hi>[\. ]*([^\n]*)<\/p>[\s]*

<div hangnum><span para #para_$1>[$2]</span></div> $3\n\n

###GATHA

Code: [Select]
<p rend=[^\w]gatha([^<>]*?)[^\w]>([^\n]+)<\/p>[\s]*

<div gatha$1>$2</div>\n\n

###INDENT|UNINDENTED

Code: [Select]
<p rend=[^\w](indent|unindented)[^\w]>([^\n]+)<\/p>[\s]*

<div $1>$2</div>\n\n

###NIKAYA

Code: [Select]
<p rend=[^\w]nikaya[^\w]>([^<>]*?)<\/p>

<div centeralign #nikaya>**$1**</div>\n<span sang_id #{file--}>[[{path-release}:{file--}|{file--}]] | [[{path-source}:{file}#{file--}|source]]</span>

###BOOK 868

Code: [Select]
<p rend=[^\w]book[^\w]>([^<>]*?)<\/p>

======== $1 ========\n<span sang_id #{file-}>[[{path-release}:{file-}|{file-}]] | [[{path-source}:{file}#{file-}|source]]</span>

###CHAPTER

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<\/p>

======= $1 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>

###TITLE

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<\/p>

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>

###SUBHEAD

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<\/p>

==== $1 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>

###SUBSUBHEAD

Code: [Select]
<p rend=[^\w]subsubhead[^\w]>([^<>]*?)<\/p>

=== $1 ===\n<span sang_id #{file-}.{no+}>[[{path-release}:{file-}.{no+}|{file-}.{no+}]] | [[{path-source}:{file}#{file-}.{no+}|source]]</span>

###SUBHEAD NOTE

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

==== $1$3 ====\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>

###CHAPTER NOTE

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

======= $1$3 =======\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>

###TITLE NOTE

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

===== $1$3 =====\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>

###SUBHEAD ANCHOR

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

==== $1$3 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n<span span anchor #$2</span>

###CHAPTER ANCHOR

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

======= $1$3 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>\n<span span anchor #$2</span>

###TITLE ANCHOR

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

===== $1$3 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>\n<span span anchor #$2</span>

###BOOK ANCHOR

Code: [Select]
<p rend=[^\w]book[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

======== $1$3 ========\n<span sang_id #{file-}>[[{path-release}:{file-}|{file-}]] | [[{path-source}:{file}#{file-}|source]]</span>\n<span span anchor #$2</span>

This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +360/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #77 on: March 15, 2019, 10:39:40 AM »
Further edits:

###DOT

Code: [Select]
<hi rend="dot">\.</hi>

.

###HANGUM INTO HEADER (JAT only) multiline

Code: [Select]
===== ([^<>]*?) =====\n<span sang_id #([^\n]*?)</span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)\n

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>$3==== $4. $5 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n

###HANGUM INTO HEADER (JAT only) multiline [..] X.

Code: [Select]
===== ([^<>]*?) =====\n<span sang_id #([^\n]*?)</span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ \[([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\] ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)\n

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>$3==== [$4] $5. $6 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n

###HANGUM INTO HEADER HH (JAT only) multiline

Code: [Select]
======= ([^<>]*?) =======[\r\n]+<span sang_id #\{file\}>\[\[\{path-release\}:\{file\}\|\{file\}\]\] \| \[\[\{path-source\}:\{file\}#\{file\}\|source\]\]<\/span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)[\r\n]+

======= $1 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>$2==== $3. $4 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n\n

###HANGUM INTO HEADER HH (JAT only) no NO.

Code: [Select]
<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([^\n]*?)[\r\n]+

==== $1 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n\n

###HANGNUM CORR (exception in bud-vgs.nk.2_any.txt and sut.sn.01.txt!!)

Code: [Select]
<p rend=[^\w]hangnum[^\w]>([^<>]+?)\.<\/p>

<div hangnum>$1.</div>

###Search "<p rend=[^\w]hangnum[^\w]>" further 47 hits in 39 files: best made one by one since many exceptions.

###BOLD corrections

without regex:

Code: [Select]
]</span> .

]</span>

correction before, ###HANGNUM, again

###P HANGNUM HI PARANUM BOLD

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<hi rend=[^\w]bold[^\w]>\.<\/hi>([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###further <hi rend="bold"> corr. are made on the single pages

###P HANGNUM HI PARANUM

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<\/p>[\s]*

<span para #para_$1>[$2]</span>\n\n

###P INTENT PARANUM

Code: [Select]
<p rend=[^\w]indent[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>\. ([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span> $3\n\n

###P HANGNUM HI PARANUM content

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###GATHA PARANUM

Code: [Select]
<div gatha1[^\w] n=[^\w]([0-9]*?)><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]*([^\n]*?)</div>

<span para #para_$1>[$2]</span>\n\n<div gatha1>$3</div>

###Correction

Code: [Select]
<div gatha2" n="-><hi rend="paranum">-</hi>

<div gatha2>

###Manual corrections for all matches of "<p rend"

###Cleanings

Code: [Select]
\r\n

\n

There might be further xml-tags left and small edits needed, but those can be made online.

Atma will now replace the placeholder (except {no}, {no+}) where he has no idea of how to process that right and effective for now, and then upload all files anew.

(Note: working/processing on replacements with batchedit online is much faster as with notepad++ local (about a 3-4 days). Of course the cleaning of cache and delete of history online takes the also a good while.)
« Last Edit: March 15, 2019, 10:44:41 AM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +360/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #78 on: March 15, 2019, 10:58:27 AM »
And using Powershell such as ((Get-Content vin.par.ve.txt -Raw) -replace '{lang}','cs-km') | Set-Content vin.par.ve.txt destroyed the files, possible a utf-8-issue... (and having not made a backup...)

all once again  ^-^ :)
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +360/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #79 on: March 16, 2019, 01:02:54 AM »
Atma will upload the renamed files with original content and try again to make the replacements online with batchedit, since having come across that Notepad sometimes loses found matches and gives nothing back when replacing.
In this way, at least, the originals would be stored on ati as well. Lets see whether web-space and sun allows it the next days.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +360/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #80 on: March 16, 2019, 04:25:30 PM »
Files are all anew uploaded so far. The Khmer files need some rest replacements of xml codes. Renamed files have been deleted.

Once the index is rebuild, the last replacements can be made.

As for the replacements of the placeholder {file}, {ns-section}... it's maybe good if runing similar scripts on the server.

In regard of {no}: no over all idea for now, so maybe good as before.

Attached an excel-list containing all particular replacements for each single file.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +360/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #81 on: March 17, 2019, 11:41:59 AM »
List of renaming of the index files (toc.xml): renaming_files#index-files_toc
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +360/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #82 on: March 23, 2019, 11:45:38 AM »
Main indexes in the four scripts should be fine and complete now:

Tipiṭaka (Roman)
តិបិដក (បាឡិ​ខ្មែរ) ติปิฎก (Thai) д̇ибидага (кириллица)
My person currently ties to rebuild the index by actualization option, which actually seems to be double slower as to build anew, but possible would not aim in no index when stopping in between (about 3000 pages of 20500 indexed since this morning)
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +360/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
After 48h 9600 pages where indexed, then the energy supply failed because of clouds and the connection as well. Atma may try it later again.

Since cool whether, it's possible good to work on giving conditions here by "finishing" certain buildings and dwellings the next time, with that what has been given or left behind.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Tags: