2つの既知の値の間の文字列を検索します

Question

たとえば、「morenonxmldata<tag1>0002</tag1>morenonxmldata」から「00002」という2つのタグ間の文字列を抽出できる必要があります。

C＃と.NET 3.5を使用しています。

Mehrdad Afshari · Accepted Answer

正規表現を必要としないソリューション：

string ExtractString(string s, string tag) { // You should check for errors in real-world code, omitted for brevity var startTag = "<" + tag + ">"; int startIndex = s.IndexOf(startTag) + startTag.Length; int endIndex = s.IndexOf("</" + tag + ">", startIndex); return s.Substring(startIndex, endIndex - startIndex); }

Aaron · Answer

 Regex regex = new Regex("<tag1>(.*)</tag1>"); var v = regex.Match("morenonxmldata<tag1>0002</tag1>morenonxmldata"); string s = v.Groups[1].ToString();

または（コメントで述べたように）最小サブセットに一致させるには：

 Regex regex = new Regex("<tag1>(.*?)</tag1>");

RegexクラスはSystem.Text.RegularExpressions名前空間。

Marc Gravell · Answer

遅延一致と後方参照を使用するRegexアプローチ：

foreach (Match match in Regex.Matches( "morenonxmldata<tag1>0002</tag1>morenonxmldata<tag2>abc</tag2>asd", @"<([^>]+)>(.*?)</\1>")) { Console.WriteLine("{0}={1}", match.Groups[1].Value, match.Groups[2].Value); }

Ozesh · Answer

2つの既知の値の間でコンテンツを抽出することは、後の場合にも役立ちます。それでは、なぜ拡張メソッドを作成しないのでしょう。ここに私がやっていること、短くてシンプルな...

 public static string GetBetween(this string content, string startString, string endString) { int Start=0, End=0; if (content.Contains(startString) && content.Contains(endString)) { Start = content.IndexOf(startString, 0) + startString.Length; End = content.IndexOf(endString, Start); return content.Substring(Start, End - Start); } else return string.Empty; }

Matinee LA · Answer

string input = "Exemple of value between two string FirstString text I want to keep SecondString end of my string"; var match = Regex.Match(input, @"FirstString (.+?) SecondString ").Groups[1].Value;

Nime Cloud · Answer

将来の参照用に、このコードスニペットを http://www.mycsharpcorner.com/Post.aspx?postID=15 で見つけました。異なる「タグ」を検索する必要がある場合、非常にうまく機能します。

 public static string[] GetStringInBetween(string strBegin, string strEnd, string strSource, bool includeBegin, bool includeEnd) { string[] result ={ "", "" }; int iIndexOfBegin = strSource.IndexOf(strBegin); if (iIndexOfBegin != -1) { // include the Begin string if desired if (includeBegin) iIndexOfBegin -= strBegin.Length; strSource = strSource.Substring(iIndexOfBegin + strBegin.Length); int iEnd = strSource.IndexOf(strEnd); if (iEnd != -1) { // include the End string if desired if (includeEnd) iEnd += strEnd.Length; result[0] = strSource.Substring(0, iEnd); // advance beyond this segment if (iEnd + strEnd.Length < strSource.Length) result[1] = strSource.Substring(iEnd + strEnd.Length); } } else // stay where we are result[1] = strSource; return result; }

Sedrick · Answer

データの前後でストリップします。

 using System; using System.Collections.Generic; using System.Linq; using System.Text; using System.Threading.Tasks; using System.Text.RegularExpressions; namespace testApp { class Program { static void Main(string[] args) { string tempString = "morenonxmldata<tag1>0002</tag1>morenonxmldata"; tempString = Regex.Replace(tempString, "[\s\S]*<tag1>", "");//removes all leading data tempString = Regex.Replace(tempString, "</tag1>[\s\S]*", "");//removes all trailing data Console.WriteLine(tempString); Console.ReadLine(); } } }

Tom · Answer

RegExなし、必須の値チェック

 public static string ExtractString(string soapMessage, string tag) { if (string.IsNullOrEmpty(soapMessage)) return soapMessage; var startTag = "<" + tag + ">"; int startIndex = soapMessage.IndexOf(startTag); startIndex = startIndex == -1 ? 0 : startIndex + startTag.Length; int endIndex = soapMessage.IndexOf("</" + tag + ">", startIndex); endIndex = endIndex > soapMessage.Length || endIndex == -1 ? soapMessage.Length : endIndex; return soapMessage.Substring(startIndex, endIndex - startIndex); }