Recent Topics

[June 26, 2019, 05:45:55 AM]

[June 26, 2019, 05:42:15 AM]

[June 25, 2019, 12:55:19 PM]

[June 24, 2019, 10:39:34 AM]

[June 24, 2019, 07:17:31 AM]

[June 24, 2019, 01:58:24 AM]

[June 23, 2019, 04:31:10 PM]

[June 23, 2019, 08:31:21 AM]

[June 23, 2019, 05:45:24 AM]

[June 22, 2019, 06:24:16 AM]

[June 22, 2019, 05:04:59 AM]

[June 21, 2019, 06:57:43 AM]

[June 20, 2019, 09:14:19 AM]

[June 17, 2019, 07:50:33 PM]

[June 16, 2019, 01:11:35 PM]

[June 16, 2019, 05:18:24 AM]

[June 16, 2019, 04:34:53 AM]

[June 15, 2019, 12:43:56 PM]

[June 15, 2019, 11:04:03 AM]

[June 15, 2019, 11:02:12 AM]

[June 15, 2019, 06:57:16 AM]

[June 15, 2019, 06:08:53 AM]

[June 12, 2019, 10:30:41 AM]

[June 12, 2019, 06:13:12 AM]

[June 12, 2019, 05:32:58 AM]

[June 12, 2019, 05:04:05 AM]

[June 12, 2019, 03:27:40 AM]

[June 11, 2019, 03:17:28 PM]

[June 11, 2019, 01:50:14 PM]

[June 11, 2019, 12:35:13 PM]

[June 11, 2019, 11:45:49 AM]

[June 09, 2019, 12:47:01 PM]

[June 09, 2019, 06:08:40 AM]

[June 06, 2019, 06:25:19 AM]

[June 06, 2019, 05:40:23 AM]

[June 06, 2019, 04:52:49 AM]

[June 06, 2019, 04:35:16 AM]

[June 04, 2019, 03:34:05 PM]

[May 31, 2019, 02:10:34 PM]

[May 31, 2019, 11:18:24 AM]

[May 31, 2019, 10:00:56 AM]

[May 31, 2019, 07:33:31 AM]

[May 30, 2019, 04:34:37 AM]

[May 29, 2019, 01:23:15 PM]

[May 29, 2019, 10:32:44 AM]

Talkbox

2019 Jun 26 01:07:18
Johann: Good to hear

2019 Jun 25 16:22:42
Cheav Villa: ជំរាបសួរបងស្រី Norum  :D _/\_

2019 Jun 25 12:48:21
Vithou:  _/\_

2019 Jun 25 05:02:43
Cheav Villa:  _/\_ _/\_ _/\_

2019 Jun 25 01:34:59
Johann: May all spend a blessed meritful Sila observing day today

2019 Jun 25 01:34:36
Johann: May all spend a blessed meritful Sila observing day today

2019 Jun 25 01:34:17
Johann: May all spend a blessed meritful Sila observing day today

2019 Jun 23 19:01:54
Vithou:  _/\_

2019 Jun 17 10:19:29
Johann: Bhante Khemakumara. Everything fine, health? Can he walk like before already?

2019 Jun 17 06:34:44
Johann: Sadhu, Sadhu

2019 Jun 17 03:56:38
Cheav Villa: សាធុ សាធុ _/\_ _/\_ _/\_

2019 Jun 17 00:46:14
Khemakumara: May all have a joy-full and fruit-full  Uposatha full-moon day

2019 Jun 15 17:53:43
Cheav Villa: កូណាព្រះអង្គទាំងអស់គ្នាសុខទុក្ខធម្មតា ទាំងសុខភាពឈឺ ជា មិនទៀងទាត់  _/\_ _/\_ _/\_ ស្រីមុខក្រញ៉ូវ គាត់មានសុខភាពល្អជា

2019 Jun 15 13:03:21
Johann: All health?

2019 Jun 15 13:02:56
Johann: Nyom Muk-kamau?

2019 Jun 15 13:01:36
Johann: And own well-being, family?

2019 Jun 15 12:48:22
Johann: Maybe all busy like most at this times.

2019 Jun 15 12:44:14
Cheav Villa: បងពុទ្ធី និងវិធូរ ខ្ញុំកូណាមិនបានជួបគ្នា និងទាក់ទងគ្នាទេ ប៉ុន្មានខែចុងក្រោយនេះ _/\_ _/\_ _/\_

2019 Jun 15 12:42:16
Cheav Villa: ថ្វាយបង្គំព្រះអង្គ  _/\_ _/\_ _/\_

2019 Jun 15 04:21:08
Johann: Nyom Villa. All fine? Nyoms fellows, like Nyom Buddhi... havn't been seen since longer. All fine with friends and family?

2019 Jun 10 04:05:23
Cheav Villa: Sadhu Sadhu  _/\_ _/\_ _/\_

2019 Jun 10 01:47:04
Johann: Sadhu, Sadhu

2019 Jun 10 01:28:53
Khemakumara: May all have a fruitful waxing moon Uposatha!

2019 Jun 10 01:26:44
Khemakumara: Silena sugatiṁ yanti. Through virtue they go to a good destination.  Silena bhoga-sampadā.  Through virtue is wealth attained.  Silena nibbutiṁ yanti. Through virtue they go to Unbinding.  Tasmā silaṁ visodhaye. Therefore we should purify our virtue.

2019 Jun 06 14:52:24
Khemakumara:  _/\_ _/\_ _/\_ kana,  Bhante. The wound heals fast and good.

2019 Jun 06 13:46:02
Johann: Bhante is fine, at least better?

2019 Jun 05 11:53:33
Cheav Villa:  _/\_ _/\_ _/\_

2019 Jun 05 05:19:35
Johann: "N'atthi santi param sukham", there is no peace equal the hail of release

2019 Jun 04 10:25:51
Johann: Nyom Villa

2019 Jun 04 05:13:11
Cheav Villa:  _/\_ _/\_ _/\_

2019 Jun 02 12:03:42
Cheav Villa: Sadhu Sadhu  _/\_ _/\_ _/\_

2019 Jun 02 00:29:09
Johann: May all enjoy the bliss of the fruitful observing of the Uposatha

2019 May 31 14:40:02
Cheav Villa: សាធុ​ សាធុ  _/\_ _/\_ _/\_

2019 May 31 13:47:09
Johann: Sadhu and Anumodana Nyom.

2019 May 31 12:40:14
Cheav Villa: សូមអោយព្រះអង្គឆាប់ជាសះស្បើយ  _/\_ _/\_ _/\_

2019 May 31 12:39:05
Cheav Villa: ព្រះអង្គKhemakumara ត្រូវបានពុទ្ធបរិស័ទនិមន្តទៅគ្លីនីកនៅជិតវត្តកាលពីម៉ោង​2 និងបានវះកាត់ព្យាបាល រួចត្រឡប់ទៅវត្ត

2019 May 31 12:36:20
Cheav Villa: ថ្វាយបង្គំ​ ព្រះ​អង្គ​ _/\_ _/\_ _/\_

2019 May 29 09:03:01
Johann: Had overseen Bhante here. Bhante Ariyadhammika  _/\_

2019 May 29 02:19:33
Khemakumara: Bhante Johann  _/\_ _/\_ _/\_

2019 May 28 04:18:48
Cheav Villa:  _/\_ _/\_ _/\_

2019 May 27 10:03:55
Johann: Much in German, Nyom, currently. Atma will try to translate as much as possible, step by step.

2019 May 26 03:04:21
Cheav Villa:  _/\_ _/\_ _/\_

2019 May 26 03:04:06
Cheav Villa: សាធុ​

2019 May 26 01:02:17
Johann: Sadhu

2019 May 26 00:44:22
Khemakumara: May all have a meritful Uposatha day. Meeting some good friends (kalyanamitta) and som(e) sil(a) សំម សីល!

2019 May 24 14:13:29
Johann: Nyom Moritz

2019 May 24 13:28:52
Moritz: Vandami Bhante _/\_

2019 May 24 05:23:33
Johann: Venerable

2019 May 24 05:22:57
Khemakumara:  _/\_ _/\_ _/\_ Bhante Johann

2019 May 24 02:08:29
Johann: Nyom Moritz, Nyom Villa.

2019 May 24 01:55:56
Cheav Villa:  _/\_ _/\_ _/\_

2019 May 24 01:54:14
Moritz: Bong Villa _/\_

2019 May 24 01:49:43
Moritz: Vandami Bhante _/\_

2019 May 24 01:06:04
Johann: Venerable Ariyadhammika  _/\_

2019 May 20 04:14:26
Cheav Villa:  _/\_ _/\_ _/\_

2019 May 20 01:31:27
Johann:  _/\_ Bhante Indannano

2019 May 19 11:28:39
Khemakumara: Nyom Cheav Villa

2019 May 19 11:27:48
Khemakumara:  _/\_ _/\_ _/\_ Bhante Johann  _/\_ _/\_ _/\_

2019 May 18 23:55:08
Moritz: Vandami Bhante _/\_

2019 May 18 10:34:49
amanaki: Thank you Johann  _/\_

2019 May 18 09:59:33
Johann: Nyom Amanaki. Mudita that you may have possible found what searched for on a special day.

2019 May 18 09:24:56
Maria:  _/\_

2019 May 18 09:24:35
Maria: werter Bhante!

2019 May 18 09:22:43
Johann: Nyom Mizi

2019 May 18 09:21:31
Johann: Nyom Sophorn, Nyom Villa... may all here but also there rejoice in own and others goodness.

2019 May 18 05:03:47
Cheav Villa: សាធុ​សាធុ _/\_ _/\_ _/\_

2019 May 18 02:16:49
Moritz: _/\_ _/\_ _/\_

2019 May 14 07:51:30
Vithou:  _/\_

2019 May 14 05:40:54
Johann: As long as not using telefon while riding. Sokh chomreoun, Nyom.

2019 May 13 18:38:46
Moritz: Vandami Bhante _/\_ (sitting in Taxi)

2019 May 12 15:44:32
Johann: But better ask Nyom Chanroth, since Atma does not walk that far these days.

2019 May 12 15:04:01
Johann: not teally, Nyom Vithou. Still less water in the streams here. Some still dry. Needs a while down from the mountains and not that much rain yet.

2019 May 12 14:54:37
Vithou: how is the road Bhante? Is it float at the mountain leg?

2019 May 12 14:51:59
Vithou:   _/\_

2019 May 12 14:40:43
Johann: Nyom Vithou. Nothing special. Yes, rain is present every afternoon since some days.

2019 May 12 14:38:33
Vithou: Bhante, how is everything at Asrum? Is it raining everyday?

2019 May 12 07:05:30
Cheav Villa:  _/\_ _/\_ _/\_

2019 May 12 03:58:19
Johann: a joyful day in merits on this Sila-day

2019 May 11 17:04:10
Cheav Villa:  :) _/\_

2019 May 11 16:16:56
Moritz: Bong Villa _/\_

2019 May 11 05:35:39
Cheav Villa: Sadhu Sadhu Sadhu  _/\_ _/\_ _/\_

2019 May 11 00:52:44
Johann: an meritful Uposatha, those keeping it today

2019 May 10 17:14:43
Moritz: Chom reap leah, I am going to work. _/\_

2019 May 10 17:09:07
Johann: Nyom Moritz

2019 May 10 17:07:14
Moritz: Vandami Bhante _/\_

2019 May 10 16:19:14
Moritz: Chom reap sour, bong Villa _/\_

2019 May 07 19:12:10
Johann: Nyom Vithou. Just some hours ago, thought of him.

2019 May 05 04:26:53
Chanroth:  _/\_ _/\_ _/\_

2019 May 04 11:41:08
Cheav Villa:  _/\_ _/\_ _/\_

2019 May 04 10:27:38
Khemakumara: Nyom Cheav Villa

2019 May 03 10:08:09
Khemakumara: Sadhu, sadhu, sadhu  _/\_ _/\_ _/\_

2019 May 03 01:17:53
Johann: A meritful new moon Uposatha those celebrating it today.

2019 May 03 01:16:05
Johann: Talk box is buggy and lines love to jump. Better not editing.

2019 May 03 01:14:19
Johann: U Chanroth: "ថ្ងៃនេះខ្ញុំបាទ បានទទួលនៅសម្ភារៈមួយចំនួន សម្រាប់កសាងអាស្រមថ្មទូកសូមជូនបុណ្យដល់ពុទ្ធបរិសទ័ទាំងអស់គ

2019 May 02 15:15:58
Cheav Villa:   <.I.> _/\_

2019 May 02 15:15:17
Cheav Villa: Sorry because of kh font doesn't run well on my phone. Kana go to edit  to see the right  shout but  was wrong by deleting Pou  Chanroth 's  shout

2019 May 02 15:01:04
Cheav Villa: Mudita  :) _/\_

2019 May 02 13:47:17
Moritz: Anumodana puñña kusala! _/\_

2019 May 01 14:49:38
Johann: Now some monks are so close to many, that they can be visited even by feet.

Tipitaka Khmer

 Please feel welcome to join the transcription project of the Tipitaka translation in khmer, and share one of your favorite Sutta or more. Simply click here or visit the Forum: 

Search ATI on ZzE

Zugang zur Einsicht - Schriften aus der Theravada Tradition



Access to Insight / Zugang zur Einsicht: Dhamma-Suche auf mehr als 4000 Webseiten (deutsch / english) - ohne zu googeln, andere Ressourcen zu nehmen, weltliche Verpflichtungen einzugehen. Sie sind für den Zugang zur Einsicht herzlich eingeladen diese Möglichkeit zu nutzen. (Info)

Random Sutta
Random Article
Random Jataka

Zufälliges Sutta
Zufälliger Artikel
Zufälliges Jataka


Arbeits/Work Forum ZzE

"Dhammatalks.org":
[logo dhammatalks.org]
Random Talk
[pic 30]

Zugang zur Einsicht - Übersetzung, Kritik und Anmerkungen

Herzlich Willkommen im Arbeitsforum von zugangzureinsicht.org im Onlinekloster sangham.net!


Danke werte(r) Besucher(in), dass Sie von dieser Möglichkeit Gebrauch machen und sich direkt einbringen wollen.

Unten (wenn Sie etwas scrollen) finden Sie eine Eingabemaske, in der Sie Ihre Eingabe einbringen können. Es stehen Ihnen auch verschiedene Gestaltungsmöglichkeiten zur Verfügung. Wenn Sie einen Text im formatierten Format abspeichern wollen, klicken Sie bitte das kleine Kästchen mit dem Pfeil.

Die Textfelder "Name" und "email" müssen ausgefüllt werden, Sie können hier aber auch eine Anonyme Angabe machen und eine Pseudo-email angeben (geben Sie, wenn Sie Rückantwort haben wollen, jedoch einen Kontakt an), wenn Ihnen das unangenehm ist. Der Name scheint im Forum als Text auf und die Email ist von niemanden außer dem Administrator einsehbar.

Wenn Sie den Text fertig geschrieben haben, müssen Sie noch den Spamschutz überwinden, das Bild zusammen setzen, und dann auf "Vorschau" oder "Senden" drücken, wenn für Sie alles passt.

Wenn Sie eine Spende einer Übersetzung machen wollen, wäre es schön, wenn Sie etwas vom Entstehen bzw. deren Herkunft erzählen und Ihrer Gabe vielleicht noch eine Widmung anhängen.

Gerne, so es möglich ist, werden wir Ihre Übersetzung dann auch den Seiten von Zugang zur Einsicht veröffentlichen. Für generelle Fragen zu dem Umfang der Dhamma-Geschenke auf ZzE sehen Sie bitte in den FAQ von ZzE ein.

Gerne empfangen wir Kritik und selbstverständlich auch Korrekturen oder Anregungen hier. Es steht Ihnen natürlich offen und Sie sind dazu herzlich eingeladen auch direkt mit einem eigenen Zugang hier an den Arbeiten vielleicht direkt teilzunehmen.

Sadhu!

metta & mudita
Ihr Zugang zur Einsicht Team

Um sich im Abeitsforum etwas unzusehen, klicken Sie hier. . Sie finden hier viele Informationen und vielleicht sogar neues rund um Zugang zur Einsicht.

Author Topic: [ATI.eu] CSCD xml to ati.eu format: converting, editing  (Read 9758 times)

0 Members and 1 Guest are viewing this topic.

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #75 on: March 11, 2019, 07:23:36 AM »
Having already found one of an potential content eater, having forgotten to escape dot

<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[. ]*<hi rend=[^\w]dot[^\w]>[. ]*<\/hi>[. ]*([^\n]*)<\/p>[\s]*   <div hangnum><span para #para_$1>[$2]</span></div> $3\n\n
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #76 on: March 12, 2019, 01:23:18 PM »
Status (lokal)

Some files have been re-renamed. Current list: Renaming of source files , renaming files.

Regex-list for xml- to ati-standard as done for "cs-rm", "cs-km", "cs-th", "cs-ru" at once.

Note that {...} strings will be replaced in a later selective session. Replacments are done "single-line" if not other mentioned.

##Starting with the header and footer, which replaces "content".

###HEADER multiline (10792 replacements)

Code: [Select]
[\s]*<\?xml(.+?)<body>[\s]*

<span hide>sources: cs-file name {cs file} path ati{lang}:{ns-section}:{file}</span>\n{{section>en:tech:template_includes#{lang}_header&nouser&nodate&noheader&noeditbutton&firstsectiononly}}\n<div {lang}>\n\n

###FOOTER multiline

Code: [Select]
[\s]*<\/body>(.*)<\/([^p]*?)>[\s]*

\n\n</div>\n{{section>en:tech:template_includes#{lang}_footer&nouser&nodate&noheader&noeditbutton&firstsectiononly}}

###CS-CD ANCHORS

Code: [Select]
<pb ed=[^\w]([^<>]*?)[^\w] n=[^\w]([^<>]*?)[^\w][\s]*\/>

<span anchor #$1_$2></span>

###BOLD

Code: [Select]
<hi rend=[^\w]bold[^\w]>([^\n]+?)<\/hi>

**$1**

###P CENTRE

Code: [Select]
<p rend=[^\w]centre[^\w]>(.*?)<\/p>

<div centeralign>$1</div>

###NOTE

Code: [Select]
<note>([^\n]+?)<\/note>

<span note>$1</span>

###P HI PARANUM DOT

Code: [Select]
<p rend=[^\w]bodytext[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<hi rend=[^\w]dot[^\w]>\.<\/hi>([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###P HI PARANUM DOT []

Code: [Select]
<p rend=[^\w]bodytext[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)[\. ]*?<\/hi>[\. ]*?([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span> $3\n\n

###P

Code: [Select]
<p rend=[^\w]bodytext[^\w]>([^\n]+?)<\/p>[\s]*

$1\n\n

###P HI PARANUM DOT

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]*<hi rend=[^\w]dot[^\w]>[\. ]*<\/hi>[\. ]*([^\n]*)<\/p>[\s]*

<div hangnum><span para #para_$1>[$2]</span></div> $3\n\n

###GATHA

Code: [Select]
<p rend=[^\w]gatha([^<>]*?)[^\w]>([^\n]+)<\/p>[\s]*

<div gatha$1>$2</div>\n\n

###INDENT|UNINDENTED

Code: [Select]
<p rend=[^\w](indent|unindented)[^\w]>([^\n]+)<\/p>[\s]*

<div $1>$2</div>\n\n

###NIKAYA

Code: [Select]
<p rend=[^\w]nikaya[^\w]>([^<>]*?)<\/p>

<div centeralign #nikaya>**$1**</div>\n<span sang_id #{file--}>[[{path-release}:{file--}|{file--}]] | [[{path-source}:{file}#{file--}|source]]</span>

###BOOK 868

Code: [Select]
<p rend=[^\w]book[^\w]>([^<>]*?)<\/p>

======== $1 ========\n<span sang_id #{file-}>[[{path-release}:{file-}|{file-}]] | [[{path-source}:{file}#{file-}|source]]</span>

###CHAPTER

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<\/p>

======= $1 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>

###TITLE

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<\/p>

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>

###SUBHEAD

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<\/p>

==== $1 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>

###SUBSUBHEAD

Code: [Select]
<p rend=[^\w]subsubhead[^\w]>([^<>]*?)<\/p>

=== $1 ===\n<span sang_id #{file-}.{no+}>[[{path-release}:{file-}.{no+}|{file-}.{no+}]] | [[{path-source}:{file}#{file-}.{no+}|source]]</span>

###SUBHEAD NOTE

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

==== $1$3 ====\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>

###CHAPTER NOTE

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

======= $1$3 =======\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>

###TITLE NOTE

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<span note>([^<>]*?)<\/span>([^<>]*?)<\/p>

===== $1$3 =====\n<div centeralign>**$1<span note>$2</span>$3**</div>\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>

###SUBHEAD ANCHOR

Code: [Select]
<p rend=[^\w]subhead[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

==== $1$3 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n<span span anchor #$2</span>

###CHAPTER ANCHOR

Code: [Select]
<p rend=[^\w]chapter[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

======= $1$3 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>\n<span span anchor #$2</span>

###TITLE ANCHOR

Code: [Select]
<p rend=[^\w]title[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

===== $1$3 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>\n<span span anchor #$2</span>

###BOOK ANCHOR

Code: [Select]
<p rend=[^\w]book[^\w]>([^<>]*?)<span anchor #([^\n]*?)<\/span>([^<>]*?)<\/p>

======== $1$3 ========\n<span sang_id #{file-}>[[{path-release}:{file-}|{file-}]] | [[{path-source}:{file}#{file-}|source]]</span>\n<span span anchor #$2</span>

This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #77 on: March 15, 2019, 10:39:40 AM »
Further edits:

###DOT

Code: [Select]
<hi rend="dot">\.</hi>

.

###HANGUM INTO HEADER (JAT only) multiline

Code: [Select]
===== ([^<>]*?) =====\n<span sang_id #([^\n]*?)</span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)\n

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>$3==== $4. $5 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n

###HANGUM INTO HEADER (JAT only) multiline [..] X.

Code: [Select]
===== ([^<>]*?) =====\n<span sang_id #([^\n]*?)</span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ \[([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\] ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)\n

===== $1 =====\n<span sang_id #{file+}>[[{path-release}:{file+}|{file+}]] | [[{path-source}:{file}#{file+}|source]]</span>$3==== [$4] $5. $6 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n

###HANGUM INTO HEADER HH (JAT only) multiline

Code: [Select]
======= ([^<>]*?) =======[\r\n]+<span sang_id #\{file\}>\[\[\{path-release\}:\{file\}\|\{file\}\]\] \| \[\[\{path-source\}:\{file\}#\{file\}\|source\]\]<\/span>(.*?)<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([១២៣៤៥៦៧៨៩០1234567890๑๒๓๔๕๖๗๘๙๐\-]+)\. ([^\n]*?)[\r\n]+

======= $1 =======\n<span sang_id #{file}>[[{path-release}:{file}|{file}]] | [[{path-source}:{file}#{file}|source]]</span>$2==== $3. $4 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n\n

###HANGUM INTO HEADER HH (JAT only) no NO.

Code: [Select]
<p rend=[^\w]hangnum[^\w]>[\s]*<\/p>[\r\n]+ ([^\n]*?)[\r\n]+

==== $1 ====\n<span sang_id #{file-}.{no}>[[{path-release}:{file-}.{no}|{file-}.{no}]] | [[{path-source}:{file}#{file-}.{no}|source]]</span>\n\n

###HANGNUM CORR (exception in bud-vgs.nk.2_any.txt and sut.sn.01.txt!!)

Code: [Select]
<p rend=[^\w]hangnum[^\w]>([^<>]+?)\.<\/p>

<div hangnum>$1.</div>

###Search "<p rend=[^\w]hangnum[^\w]>" further 47 hits in 39 files: best made one by one since many exceptions.

###BOLD corrections

without regex:

Code: [Select]
]</span> .

]</span>

correction before, ###HANGNUM, again

###P HANGNUM HI PARANUM BOLD

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<hi rend=[^\w]bold[^\w]>\.<\/hi>([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###further <hi rend="bold"> corr. are made on the single pages

###P HANGNUM HI PARANUM

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\s]*<\/p>[\s]*

<span para #para_$1>[$2]</span>\n\n

###P INTENT PARANUM

Code: [Select]
<p rend=[^\w]indent[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>\. ([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span> $3\n\n

###P HANGNUM HI PARANUM content

Code: [Select]
<p rend=[^\w]hangnum[^\w] n=[^\w]([^<>]*?)[^\w]>[\s]*<hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]([^\n]*?)<\/p>[\s]*

<span para #para_$1>[$2]</span>$3\n\n

###GATHA PARANUM

Code: [Select]
<div gatha1[^\w] n=[^\w]([0-9]*?)><hi rend=[^\w]paranum[^\w]>([^<>]*?)<\/hi>[\. ]*([^\n]*?)</div>

<span para #para_$1>[$2]</span>\n\n<div gatha1>$3</div>

###Correction

Code: [Select]
<div gatha2" n="-><hi rend="paranum">-</hi>

<div gatha2>

###Manual corrections for all matches of "<p rend"

###Cleanings

Code: [Select]
\r\n

\n

There might be further xml-tags left and small edits needed, but those can be made online.

Atma will now replace the placeholder (except {no}, {no+}) where he has no idea of how to process that right and effective for now, and then upload all files anew.

(Note: working/processing on replacements with batchedit online is much faster as with notepad++ local (about a 3-4 days). Of course the cleaning of cache and delete of history online takes the also a good while.)
« Last Edit: March 15, 2019, 10:44:41 AM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #78 on: March 15, 2019, 10:58:27 AM »
And using Powershell such as ((Get-Content vin.par.ve.txt -Raw) -replace '{lang}','cs-km') | Set-Content vin.par.ve.txt destroyed the files, possible a utf-8-issue... (and having not made a backup...)

all once again  ^-^ :)
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #79 on: March 16, 2019, 01:02:54 AM »
Atma will upload the renamed files with original content and try again to make the replacements online with batchedit, since having come across that Notepad sometimes loses found matches and gives nothing back when replacing.
In this way, at least, the originals would be stored on ati as well. Lets see whether web-space and sun allows it the next days.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #80 on: March 16, 2019, 04:25:30 PM »
Files are all anew uploaded so far. The Khmer files need some rest replacements of xml codes. Renamed files have been deleted.

Once the index is rebuild, the last replacements can be made.

As for the replacements of the placeholder {file}, {ns-section}... it's maybe good if runing similar scripts on the server.

In regard of {no}: no over all idea for now, so maybe good as before.

Attached an excel-list containing all particular replacements for each single file.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #81 on: March 17, 2019, 11:41:59 AM »
List of renaming of the index files (toc.xml): renaming_files#index-files_toc
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #82 on: March 23, 2019, 11:45:38 AM »
Main indexes in the four scripts should be fine and complete now:

Tipiṭaka (Roman)
តិបិដក (បាឡិ​ខ្មែរ) ติปิฎก (Thai) д̇ибидага (кириллица)
My person currently ties to rebuild the index by actualization option, which actually seems to be double slower as to build anew, but possible would not aim in no index when stopping in between (about 3000 pages of 20500 indexed since this morning)
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
from: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #83 on: April 01, 2019, 12:23:19 PM »

Aramika   *

Ein oder mehrer Beiträge wurden hier im Thema abgeschnitten und damit in neues Thema "[ati.eu] Indexing, search engine " eröffnet, dem angehäng.
« Last Edit: April 01, 2019, 12:29:19 PM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #84 on: April 02, 2019, 08:18:28 AM »
{lang} and {ns-section} have now replaced on all pages except the 416 pages in cs-th (Thai, 268 in Atthakatha and 148 pages in Tika)

The further replacements ({file}, {path-source}...) could be made according the list above either page for page or with a script using the list. Files+/- etc, how ever, may need further renderings later. {no}... the same.

Sadhu for the great work and assitence of many to bring the first four languages into here and the availability for the Sangha and those with Nissaya.

Atma will look after the last xml converting into ati-syntax in the Khmer pages and then look after the css for "good" layouts.

An Excel-file which is of help for creation of the release files, also in languages to come, can be used: renaming_list.xlsx To extract them into directories and files for an upload the Converting lists into txt-files - Tools for Ati.eu can be used.
« Last Edit: April 07, 2019, 01:24:01 PM by Johann »
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Offline Johann

  • Samanera
  • Very Engaged Member
  • *
  • Sadhu! or +362/-0
  • Gender: Male
  • Date of ordination/Datum der Ordination.: 20140527
Re: [ATI.eu] CSCD xml to ati.eu format: converting, editing
« Reply #85 on: April 11, 2019, 12:38:23 PM »
Currently working on the "single-sutta release" files, which can require some time, given about 40.000 headers, but would then also give finally values for the {no..} replacements (for links to them) in the source-files.

Since making single files for Atthakatha and Tika would cause huge amount of files, if not skipping, and so Atma thought of implementing the related commentaries direct in the Sutta (Mula) files.
This post and Content has come to be by Dhamma-Dana and so is given as it       Dhamma-Dana: Johann

Tags: