Recent Topics

[Today at 03:41:17 PM]

[Today at 03:16:44 PM]

[Today at 02:41:43 PM]

[Today at 12:33:43 PM]

[Today at 12:27:07 PM]

[Today at 06:23:18 AM]

[Today at 01:21:48 AM]

[Today at 12:58:15 AM]

[October 21, 2019, 01:23:28 PM]

[October 21, 2019, 02:56:43 AM]

[October 20, 2019, 01:27:22 AM]

[October 19, 2019, 09:44:35 AM]

[October 16, 2019, 06:00:56 PM]

[October 16, 2019, 05:21:14 PM]

[October 16, 2019, 05:18:07 PM]

[October 16, 2019, 05:16:01 PM]

[October 16, 2019, 05:09:19 PM]

[October 15, 2019, 06:13:46 AM]

[October 15, 2019, 06:00:49 AM]

[October 15, 2019, 02:36:49 AM]

[October 12, 2019, 03:21:40 AM]

[October 10, 2019, 10:55:16 AM]

[October 09, 2019, 03:55:44 PM]

[October 09, 2019, 12:00:44 PM]

[October 07, 2019, 01:19:41 PM]

[October 07, 2019, 09:16:21 AM]

[October 07, 2019, 09:01:58 AM]

[October 07, 2019, 02:05:20 AM]

[October 06, 2019, 05:08:01 PM]

Talkbox

2019 Oct 22 14:05:14
Cheav Villa:  _/\_ _/\_ _/\_

2019 Oct 22 11:44:21
Johann: "eko care": the real, the highest eco care, climate protection

2019 Oct 21 13:53:58
Johann: eko care!

2019 Oct 21 13:10:14
Moritz: Chom reap leah _/\_ May all spend a good uposatha day. _/\_ _/\_ _/\_

2019 Oct 21 12:13:15
Johann: Meister Moritz

2019 Oct 21 12:12:45
Moritz: Bong Villa _/\_

2019 Oct 21 12:12:40
Moritz: Vandami Bhante _/\_

2019 Oct 21 11:10:14
Cheav Villa: Kana Bhante _/\_ _/\_ _/\_ Kana got the email _/\_ _/\_ _/\_

2019 Oct 21 06:14:10
Khemakumara: Sadhu and mudita _/\_ _/\_ _/\_

2019 Oct 21 04:24:03
Cheav Villa: Sadhu Sadhu  _/\_ _/\_ _/\_

2019 Oct 21 04:00:46
Johann: May all spend a meritful Siladay today

2019 Oct 20 11:27:41
Varado: I pay respects to the Venerable Indaññāno and the Noble Sangha. May you please consider me a Sanghamitta.

2019 Oct 20 11:17:39
Varado: I pay respects to Venerable Indaññāno.

2019 Oct 17 12:06:02
Moritz: Chom reap leah _/\_ May all have a kusala day _/\_ _/\_ _/\_

2019 Oct 17 11:51:37
Moritz: Bong Villa :) _/\_

2019 Oct 17 11:51:25
Moritz: Vandami Bhante Varado _/\_ _/\_ _/\_

2019 Oct 17 11:50:47
Cheav Villa: Vandami Bhante :) _/\_ _/\_ _/\_

2019 Oct 17 11:50:25
Moritz: Vandami Bhante Johann _/\_ _/\_ _/\_

2019 Oct 17 11:48:30
Johann: Nyom Moritz

2019 Oct 17 11:45:08
Johann: Nyom Villa

2019 Oct 17 11:44:00
Cheav Villa: Chum reap sur Master Moritz :D _/\_

2019 Oct 16 03:29:01
Cheav Villa: Vandami Bhante Ariyadhammika  _/\_ _/\_ _/\_

2019 Oct 15 15:28:14
Cheav Villa:  _/\_ _/\_ _/\_

2019 Oct 15 14:56:50
Johann: Nyom Villa said: May Brah Ang recover fast. May Oncle Chanroth recover fastl as well.

2019 Oct 15 14:49:29
Cheav Villa: សូមអោយពូចាន់រ័ត្មឆាប់ជាពីជម្ងឺដែរ   :D _/\_

2019 Oct 15 14:48:46
Cheav Villa: សូមអោយព្រះអង្គឆាប់ជាពីជម្ងឺ _/\_  _/\_ _/\_

2019 Oct 15 14:14:23
Johann: Chanroth said: At times Brah Ang has ābādhi (illness), I am also sick.

2019 Oct 15 13:24:21
Chanroth: ពេលព្រះអង្គមានអាពាធិ ខ្ញុំមានជំម្ងឺដែរ _/\_  _/\_ _/\_

2019 Oct 14 10:22:27
Cheav Villa: Bhante Varado  _/\_ _/\_ _/\_

2019 Oct 13 04:44:10
Cheav Villa:  _/\_ _/\_ _/\_

2019 Oct 13 01:17:17
Johann: Sadhu

2019 Oct 13 01:15:59
Khemakumara: May every living being in all realms be free from ever illness, free from danger, free from animosity and free from oppression.

2019 Oct 13 00:50:55
Khemakumara: may it be a path-and fruitful Uposatha Day

2019 Oct 12 15:40:31
Cheav Villa: May Bhante soon recover from malaria  _/\_ _/\_ _/\_

2019 Oct 12 13:47:46
Varado:  _/\_ _/\_ _/\_

2019 Oct 11 16:35:52
Cheav Villa:  _/\_ _/\_ _/\_

2019 Oct 10 10:55:45
Khemakumara: Nyom Cheav Villa

2019 Oct 10 10:54:11
Cheav Villa:  _/\_ _/\_ _/\_

2019 Oct 10 05:11:04
Khemakumara: _/\_ _/\_ _/\_ Bhante

2019 Oct 08 15:01:04
Vithou:  _/\_

2019 Oct 07 07:39:21
Johann: Good to hear

2019 Oct 07 05:48:11
Khemakumara: _/\_ _/\_ _/\_ Kana Bhante. may kana trust, that  prah  karuṇā dwells in kusalā  conditions in and outwardly.

2019 Oct 07 05:09:34
Johann: Bhante. All health and relaying on good outwardly and inwardly conditions?

2019 Oct 07 04:20:19
Khemakumara: _/\_ _/\_ _/\_ Bhante

2019 Oct 06 06:44:19
Cheav Villa:  _/\_ _/\_ _/\_

2019 Oct 06 06:24:49
Johann: Sadhu

2019 Oct 06 04:48:36
Khemakumara: something to give (in)? Today is waxing moon Uposatha! May it be a path- and fruitful give (in) day!

2019 Oct 06 04:37:01
Khemakumara:  *gift*

2019 Oct 02 16:38:31
Cheav Villa:  _/\_  _/\_ _/\_

2019 Oct 02 12:27:40
Khemakumara: Sadhu and mudita

2019 Oct 02 12:06:36
Johann: (U.Chanroth wrote in mudita: Today Buddhaparisada arrived at Ashram Thmo Duk)

2019 Oct 02 08:31:08
Chanroth: ថ្ងៃនេះពុទ្ធបរិស័ទ្ទមកដល់អាស្រមថ្មទូក _/\_ _/\_ _/\_

2019 Sep 30 06:49:48
Cheav Villa: Vandami Bhante  _/\_ _/\_ _/\_

2019 Sep 30 06:15:26
Johann: Nyom Villa

2019 Sep 29 05:35:53
Cheav Villa:  _/\_ _/\_ _/\_

2019 Sep 28 06:56:38
Johann: Nyom Chanroth wrote: "Natthi santi paraṃ sokhaṃ" No happiness higher then that of peace.

2019 Sep 28 04:22:03
Chanroth: នត្ថិ សន្តិ បរំ សុខំ

2019 Sep 28 04:06:00
Chanroth:   _/\_ _/\_ _/\_

2019 Sep 28 01:57:20
Khemakumara: _/\_ _/\_ _/\_

2019 Sep 28 01:40:27
Johann: A blessed and virtuous new-moon Uposata for those holding it today.

2019 Sep 26 01:28:41
Johann: Bhante

2019 Sep 26 01:17:26
Khemakumara: _/\_ _/\_ _/\_ Bhante

2019 Sep 23 05:39:08
Khemakumara:  _/\_ _/\_ _/\_

2019 Sep 23 05:24:36
Johann: Bhante, Bhante

2019 Sep 22 18:47:42
Johann: Kana needs to leave now for possible tomorrow again, Bhante. Much dhammic joy in further exploring here within. and a blessed halfmoon day.

2019 Sep 22 16:41:02
Johann: _/\_ Bhante Varado (batterie of mine will be off soon, just to inform)

2019 Sep 22 07:28:15
Johann: Sadhu

2019 Sep 22 03:26:27
Cheav Villa:  _/\_ _/\_ _/\_

2019 Sep 22 02:48:14
Khemakumara: may it be a path-and fruitful Uposatha Day

2019 Sep 22 02:48:14
Khemakumara: may it be a path-and fruitful Uposatha Day

2019 Sep 21 23:53:54
Vithou:  _/\_

2019 Sep 21 13:36:07
Cheav Villa:  _/\_ _/\_ _/\_

2019 Sep 21 09:37:51
Johann: A blessed and meritful Sila-day those who keep it today.

2019 Sep 20 07:29:23
Johann:  _/\_ Bhante Varado

2019 Sep 19 14:52:03
Johann: Bhante

2019 Sep 19 14:36:44
Khemakumara: Nyom saddhamma

2019 Sep 19 14:35:55
Khemakumara: _/\_ _/\_ _/\_ Bhante

2019 Sep 19 09:58:15
Cheav Villa:  _/\_ _/\_ _/\_ Bhante

2019 Sep 19 04:31:27
Khemakumara: Nyom Cheav Villa

2019 Sep 19 01:47:48
Khemakumara: _/\_ _/\_ _/\_ Bhante

2019 Sep 18 16:44:10
Johann:  _/\_ Bhante Ariyadhammika

2019 Sep 18 06:35:53
Khemakumara:  _/\_ _/\_ _/\_ Kana, Bhante and kana are well!

2019 Sep 18 06:32:01
Johann: Bhante  _/\_ Is he and Bhante well?

2019 Sep 18 04:25:43
Khemakumara: _/\_ _/\_ _/\_ Bhante

2019 Sep 18 04:10:57
Johann: Nyom Villa.

2019 Sep 18 02:48:47
Cheav Villa: Welcome, upasaka sadhamma _/\_

2019 Sep 18 02:47:31
Cheav Villa: Master Moritz _/\_

2019 Sep 18 02:47:15
Cheav Villa: Vandami Bhante _/\_ _/\_ _/\_

2019 Sep 18 01:59:53
Moritz: Good night _/\_

2019 Sep 18 01:59:44
Moritz: _/\_ Bong Villa

2019 Sep 18 01:41:47
Johann: Nyom Moritz

2019 Sep 18 01:30:08
Moritz: Welcome, upasaka sadhamma _/\_

2019 Sep 18 01:29:43
Moritz: Vandami Bhante _/\_ _/\_ _/\_

2019 Sep 14 01:20:37
Johann: (no problem at all) A blessed and meritful conducted Uposatha those observing it today.

2019 Sep 13 16:07:47
Cheav Villa: សុំទោស _/\_

2019 Sep 13 16:06:36
Cheav Villa:  _/\_

2019 Sep 13 15:53:17
Cheav Villa: Vandami Bhante _/\_ _/\_ _/\_ Bhante Varado _/\_ _/\_ _/\_

2019 Sep 13 15:51:32
Johann:  _/\_ Bante Varado. Nyom Morotz, Nyom Villa.

2019 Sep 13 15:50:03
Cheav Villa: Master Morit  :) _/\_

2019 Sep 13 15:47:31
Moritz: _/\_ _/\_ _/\_ Bhante Varado _/\_ _/\_ _/\_ Bhante Johann _/\_ Bong Villa

Tipitaka Khmer

 Please feel welcome to join the transcription project of the Tipitaka translation in khmer, and share one of your favorite Sutta or more. Simply click here or visit the Forum: 

Search ATI on ZzE

Zugang zur Einsicht - Schriften aus der Theravada Tradition



Access to Insight / Zugang zur Einsicht: Dhamma-Suche auf mehr als 4000 Webseiten (deutsch / english) - ohne zu googeln, andere Ressourcen zu nehmen, weltliche Verpflichtungen einzugehen. Sie sind für den Zugang zur Einsicht herzlich eingeladen diese Möglichkeit zu nutzen. (Info)

Random Sutta
Random Article
Random Jataka

Zufälliges Sutta
Zufälliger Artikel
Zufälliges Jataka


Arbeits/Work Forum ZzE

"Dhammatalks.org":
[logo dhammatalks.org]
Random Talk
[pic 30]

Dear Visitor!

Herzlich Willkommen auf sangham.net! Welcome to sangham.net!
Ehrenwerter Gast, fühlen sie sich willkommen!

Sie können sich gerne auch unangemeldet an jeder Diskussion beteiligen und eine Antwort posten. Auch ist es Ihnen möglich, ein Post oder ein Thema an die Moderatoren zu melden, sei es nun, um ein Lob auszusprechen oder um zu tadeln. Beides ist willkommen, wenn es gut gemeint und umsichtig ist. Lesen Sie mehr dazu im Beitrag: Melden/Kommentieren von Postings für Gäste
Sie können sich aber auch jederzeit anmelden oder sich via Email einladen und anmelden lassen oder als "Visitor" einloggen, und damit stehen Ihnen noch viel mehr Möglichkeiten frei. Nutzen Sie auch die Möglichkeit einen Segen auszusprechen oder ein Räucherstäbchen anzuzünden und wir freuen uns, wenn Sie sich auch als Besucher kurz vorstellen oder Hallo sagen .
Wir wünschen viel Freude beim Nutzen und Entdecken des Forums mit all seinen nützlichen Möglichkeiten .
 
Wählen Sie Ihre bevorzugte Sprache rechts oben neben dem Suchfenster.

Wähle Sprache / Choose Language / เลือก ภาษา / ជ្រើសយកភាសា: ^ ^
 Venerated Visitor, feel heartily welcome!
You are able to participate in discussions and post even without registration. You are also able to report a post or topic to the moderators, may it be praise or a rebuke. Both is welcome if it is meant with good will and care. Read more about it within the post: Report/comment posts for guests
But you can also register any time or get invited and registered in the way to request via Email , or log in as "Visitor". If you are logged in you will have more additional possibilities. Please feel free to use the possibility to  give a blessing or light an incent stick and we are honored if you introduce yourself or say "Hello" even if you are on a short visit.
We wish you much joy in using and exploring the forum with all its useful possibilities  
Choose your preferred language on the right top corner next to the search window!

Zugang zur Einsicht - Übersetzung, Kritik und Anmerkungen

Herzlich Willkommen im Arbeitsforum von zugangzureinsicht.org im Onlinekloster sangham.net!


Danke werte(r) Besucher(in), dass Sie von dieser Möglichkeit Gebrauch machen und sich direkt einbringen wollen.

Unten (wenn Sie etwas scrollen) finden Sie eine Eingabemaske, in der Sie Ihre Eingabe einbringen können. Es stehen Ihnen auch verschiedene Gestaltungsmöglichkeiten zur Verfügung. Wenn Sie einen Text im formatierten Format abspeichern wollen, klicken Sie bitte das kleine Kästchen mit dem Pfeil.

Die Textfelder "Name" und "email" müssen ausgefüllt werden, Sie können hier aber auch eine Anonyme Angabe machen und eine Pseudo-email angeben (geben Sie, wenn Sie Rückantwort haben wollen, jedoch einen Kontakt an), wenn Ihnen das unangenehm ist. Der Name scheint im Forum als Text auf und die Email ist von niemanden außer dem Administrator einsehbar.

Wenn Sie den Text fertig geschrieben haben, müssen Sie noch den Spamschutz überwinden, das Bild zusammen setzen, und dann auf "Vorschau" oder "Senden" drücken, wenn für Sie alles passt.

Wenn Sie eine Spende einer Übersetzung machen wollen, wäre es schön, wenn Sie etwas vom Entstehen bzw. deren Herkunft erzählen und Ihrer Gabe vielleicht noch eine Widmung anhängen.

Gerne, so es möglich ist, werden wir Ihre Übersetzung dann auch den Seiten von Zugang zur Einsicht veröffentlichen. Für generelle Fragen zu dem Umfang der Dhamma-Geschenke auf ZzE sehen Sie bitte in den FAQ von ZzE ein.

Gerne empfangen wir Kritik und selbstverständlich auch Korrekturen oder Anregungen hier. Es steht Ihnen natürlich offen und Sie sind dazu herzlich eingeladen auch direkt mit einem eigenen Zugang hier an den Arbeiten vielleicht direkt teilzunehmen.

Sadhu!

metta & mudita
Ihr Zugang zur Einsicht Team

Um sich im Abeitsforum etwas unzusehen, klicken Sie hier. . Sie finden hier viele Informationen und vielleicht sogar neues rund um Zugang zur Einsicht.

Author Topic: [ATI.eu] CSCD xml to ati.eu format: converting, editing  (Read 11076 times)

0 Members and 1 Guest are viewing this topic.

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #75 on: March 11, 2019, 07:23:36 AM »
Having already found one of an potential content eater, having forgotten to escape dot

<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[. ]*<hi rend=[^\w]dot[^\w]>[. ]*<\/hi>[. ]*([^\n]*)<\/p>[\s]*   <div hangnum><span para #para_$1>[$2]</span></div> $3\n\n
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #76 on: March 12, 2019, 01:23:18 PM »
Status (lokal)

Some files have been re-renamed. Current list: Renaming of source files , renaming files.

Regex-list for xml- to ati-standard as done for "cs-rm", "cs-km", "cs-th", "cs-ru" at once.

Note that {...} strings will be replaced in a later selective session. Replacments are done "single-line" if not other mentioned.

##Starting with the header and footer, which replaces "content".

###HEADER multiline (10792 replacements)

Code: [Select]
[\s]*<\?xml(.+?)<body>[\s]*

<span hide>sources: cs-file name {cs file} path ati{lang}:{ns-section}:{file}</span>\n{{section>en:tech:template_includes#{lang}_header&nouser&nodate&noheader&noeditbutton&firstsectiononly}}\n<div {lang}>\n\n

###FOOTER multiline

Code: [Select]
[\s]*<\/body>(.*)<\/([^p]*?)>[\s]*

\n\n</div>\n{{section>en:tech:template_includes#{lang}_footer&nouser&nodate&noheader&noeditbutton&firstsectiononly}}

###CS-CD ANCHORS

Code: [Select]
<pb ed=[^\w]([^<>]*?)[^\w] n=[^\w]([^<>]*?)[^\w][\s]*\/>

<span anchor #$1_$2></span>

###BOLD

Code: [Select]
<hi rend=[^\w]bold[^\w]>([^\n]+?)<\/hi>

**$1**

###P CENTRE

Code: [Select]
<p rend=[^\w]centre[^\w]>(.*?)<\/p>

<div centeralign>$1</div>

###NOTE

Code: [Select]
<note>([^\n]+?)<\/note>

<span note>$1</span>

###P HI PARANUM DOT

Code: [Select]
<p rend=[^\w]bodytext[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<hi rend=[^\w]dot[^\w]>\.<\/hi>([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###P HI PARANUM DOT []

Code: [Select]
<p rend=[^\w]bodytext[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)[\. ]*?<\/hi>[\. ]*?([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span> $3\n\n

###P

Code: [Select]
<p rend=[^\w]bodytext[^\w]>([^\n]+?)<\/p>[\s]*

$1\n\n

###P HI PARANUM DOT

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]*<hi rend=[^\w]dot[^\w]>[\. ]*<\/hi>[\. ]*([^\n]*)<\/p>[\s]*

<div hangnum><span para #para_$1>[$2]</span></div> $3\n\n

###GATHA

Code: [Select]
<p rend=[^\w]gatha([^<>]*?)[^\w]>([^\n]+)<\/p>[\s]*

<div gatha$1>$2</div>\n\n

###INDENT|UNINDENTED

Code: [Select]
<p rend=[^\w](indent|unindented)[^\w]>([^\n]+)<\/p>[\s]*

<div $1>$2</div>\n\n

###NIKAYA

Code: [Select]
<p rend=[^\w]nikaya[^\w]>([^<>]*?)<\/p>

<div centeralign #nikaya>**$1**</div>\n<span sang_id #{file--}>[[{path-release}:{file--}|{file--}]] | [[{path-source}:{file}#{file--}|source]]</span>

###BOOK 868

Code: [Select]
<p rend=[^\w]book[^\w]>([^<>]*?)<\/p>

======== $1 ========\n<span sang_id #{file-}>[[{path-release}:{file-}|{file-}]] | [[{path-source}:{file}#{file-}|source]]</span>

###CHAPTER

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<\/p>

======= $1 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>

###TITLE

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<\/p>

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>

###SUBHEAD

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<\/p>

==== $1 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>

###SUBSUBHEAD

Code: [Select]
<p rend=[^\w]subsubhead[^\w]>([^<>]*?)<\/p>

=== $1 ===\n<span sang_id #{file-}.{no+}>[[{path-release}:{file-}.{no+}|{file-}.{no+}]] | [[{path-source}:{file}#{file-}.{no+}|source]]</span>

###SUBHEAD NOTE

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

==== $1$3 ====\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>

###CHAPTER NOTE

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

======= $1$3 =======\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>

###TITLE NOTE

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

===== $1$3 =====\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>

###SUBHEAD ANCHOR

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

==== $1$3 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n<span span anchor #$2</span>

###CHAPTER ANCHOR

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

======= $1$3 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>\n<span span anchor #$2</span>

###TITLE ANCHOR

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

===== $1$3 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>\n<span span anchor #$2</span>

###BOOK ANCHOR

Code: [Select]
<p rend=[^\w]book[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

======== $1$3 ========\n<span sang_id #{file-}>[[{path-release}:{file-}|{file-}]] | [[{path-source}:{file}#{file-}|source]]</span>\n<span span anchor #$2</span>

This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #77 on: March 15, 2019, 10:39:40 AM »
Further edits:

###DOT

Code: [Select]
<hi rend="dot">\.</hi>

.

###HANGUM INTO HEADER (JAT only) multiline

Code: [Select]
===== ([^<>]*?) =====\n<span sang_id #([^\n]*?)</span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)\n

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>$3==== $4. $5 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n

###HANGUM INTO HEADER (JAT only) multiline [..] X.

Code: [Select]
===== ([^<>]*?) =====\n<span sang_id #([^\n]*?)</span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ \[([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\] ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)\n

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>$3==== [$4] $5. $6 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n

###HANGUM INTO HEADER HH (JAT only) multiline

Code: [Select]
======= ([^<>]*?) =======[\r\n]+<span sang_id #\{file\}>\[\[\{path-release\}:\{file\}\|\{file\}\]\] \| \[\[\{path-source\}:\{file\}#\{file\}\|source\]\]<\/span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)[\r\n]+

======= $1 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>$2==== $3. $4 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n\n

###HANGUM INTO HEADER HH (JAT only) no NO.

Code: [Select]
<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([^\n]*?)[\r\n]+

==== $1 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n\n

###HANGNUM CORR (exception in bud-vgs.nk.2_any.txt and sut.sn.01.txt!!)

Code: [Select]
<p rend=[^\w]hangnum[^\w]>([^<>]+?)\.<\/p>

<div hangnum>$1.</div>

###Search "<p rend=[^\w]hangnum[^\w]>" further 47 hits in 39 files: best made one by one since many exceptions.

###BOLD corrections

without regex:

Code: [Select]
]</span> .

]</span>

correction before, ###HANGNUM, again

###P HANGNUM HI PARANUM BOLD

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<hi rend=[^\w]bold[^\w]>\.<\/hi>([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###further <hi rend="bold"> corr. are made on the single pages

###P HANGNUM HI PARANUM

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<\/p>[\s]*

<span para #para_$1>[$2]</span>\n\n

###P INTENT PARANUM

Code: [Select]
<p rend=[^\w]indent[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>\. ([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span> $3\n\n

###P HANGNUM HI PARANUM content

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###GATHA PARANUM

Code: [Select]
<div gatha1[^\w] n=[^\w]([0-9]*?)><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]*([^\n]*?)</div>

<span para #para_$1>[$2]</span>\n\n<div gatha1>$3</div>

###Correction

Code: [Select]
<div gatha2" n="-><hi rend="paranum">-</hi>

<div gatha2>

###Manual corrections for all matches of "<p rend"

###Cleanings

Code: [Select]
\r\n

\n

There might be further xml-tags left and small edits needed, but those can be made online.

Atma will now replace the placeholder (except {no}, {no+}) where he has no idea of how to process that right and effective for now, and then upload all files anew.

(Note: working/processing on replacements with batchedit online is much faster as with notepad++ local (about a 3-4 days). Of course the cleaning of cache and delete of history online takes the also a good while.)
« Last Edit: March 15, 2019, 10:44:41 AM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #78 on: March 15, 2019, 10:58:27 AM »
And using Powershell such as ((Get-Content vin.par.ve.txt -Raw) -replace '{lang}','cs-km') | Set-Content vin.par.ve.txt destroyed the files, possible a utf-8-issue... (and having not made a backup...)

all once again  ^-^ :)
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #79 on: March 16, 2019, 01:02:54 AM »
Atma will upload the renamed files with original content and try again to make the replacements online with batchedit, since having come across that Notepad sometimes loses found matches and gives nothing back when replacing.
In this way, at least, the originals would be stored on ati as well. Lets see whether web-space and sun allows it the next days.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #80 on: March 16, 2019, 04:25:30 PM »
Files are all anew uploaded so far. The Khmer files need some rest replacements of xml codes. Renamed files have been deleted.

Once the index is rebuild, the last replacements can be made.

As for the replacements of the placeholder {file}, {ns-section}... it's maybe good if runing similar scripts on the server.

In regard of {no}: no over all idea for now, so maybe good as before.

Attached an excel-list containing all particular replacements for each single file.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #81 on: March 17, 2019, 11:41:59 AM »
List of renaming of the index files (toc.xml): renaming_files#index-files_toc
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #82 on: March 23, 2019, 11:45:38 AM »
Main indexes in the four scripts should be fine and complete now:

Tipiṭaka (Roman)
តិបិដក (បាឡិ​ខ្មែរ) ติปิฎก (Thai) д̇ибидага (кириллица)
My person currently ties to rebuild the index by actualization option, which actually seems to be double slower as to build anew, but possible would not aim in no index when stopping in between (about 3000 pages of 20500 indexed since this morning)
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
from: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #83 on: April 01, 2019, 12:23:19 PM »

Aramika   *

Ein oder mehrer Beiträge wurden hier im Thema abgeschnitten und damit in neues Thema "[ati.eu] Indexing, search engine " eröffnet, dem angehäng.
« Last Edit: April 01, 2019, 12:29:19 PM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #84 on: April 02, 2019, 08:18:28 AM »
{lang} and {ns-section} have now replaced on all pages except the 416 pages in cs-th (Thai, 268 in Atthakatha and 148 pages in Tika)

The further replacements ({file}, {path-source}...) could be made according the list above either page for page or with a script using the list. Files+/- etc, how ever, may need further renderings later. {no}... the same.

Sadhu for the great work and assitence of many to bring the first four languages into here and the availability for the Sangha and those with Nissaya.

Atma will look after the last xml converting into ati-syntax in the Khmer pages and then look after the css for "good" layouts.

An Excel-file which is of help for creation of the release files, also in languages to come, can be used: renaming_list.xlsx To extract them into directories and files for an upload the Converting lists into txt-files - Tools for Ati.eu can be used.
« Last Edit: April 07, 2019, 01:24:01 PM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +366/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #85 on: April 11, 2019, 12:38:23 PM »
Currently working on the "single-sutta release" files, which can require some time, given about 40.000 headers, but would then also give finally values for the {no..} replacements (for links to them) in the source-files.

Since making single files for Atthakatha and Tika would cause huge amount of files, if not skipping, and so Atma thought of implementing the related commentaries direct in the Sutta (Mula) files.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Tags: