forループ内のifステートメントを回避しますか？

Question

次のような関数Writerを持つwriteVectorというクラスがあります。

_void Drawer::writeVector(vector<T> vec, bool index=true) { for (unsigned int i = 0; i < vec.size(); i++) { if (index) { cout << i << "	"; } cout << vec[i] << "
"; } } _

パフォーマンスを心配しながら、コードが重複しないようにしています。関数では、結果が常に同じであっても、for-ループのすべてのラウンドでif (index)チェックを実行しています。これは「パフォーマンスを心配する」ことに反対です。

for- loopの外側にチェックを配置することで、これを簡単に回避できます。ただし、重複するコードを大量に取得します。

_void Drawer::writeVector(...) { if (index) { for (...) { cout << i << "	" << vec[i] << "
"; } } else { for (...) { cout << vec[i] << "
"; } } } _

したがって、これらは両方とも私にとって「悪い」ソリューションです。私が考えていたのは、2つのプライベート関数です。1つはインデックスを使用し、もう1つは呼び出します。もう1つは値のみを出力します。しかし、私のプログラムでそれを使用する方法を理解することはできません、私はまだifチェックを呼び出してどれを確認する必要があります...

この問題によると、多型は正しい解決策のようです。しかし、私はそれをここでどのように使うべきかわかりません。この種の問題を解決する好ましい方法は何でしょうか？

これは実際のプログラムではありませんこの種の問題をどのように解決するかを学ぶことに興味があります

Ali · Accepted Answer

ファンクタとしてループの本体を渡す。コンパイル時にインライン化され、パフォーマンスが低下することはありません。

さまざまなものを渡すという考え方は、C++標準ライブラリではどこにでもあります。これは戦略パターンと呼ばれます

C++ 11の使用が許可されている場合、次のようなことができます。

#include <iostream> #include <set> #include <vector> template <typename Container, typename Functor, typename Index = std::size_t> void for_each_indexed(const Container& c, Functor f, Index index = 0) { for (const auto& e : c) f(index++, e); } int main() { using namespace std; set<char> s{'b', 'a', 'c'}; // indices starting at 1 instead of 0 for_each_indexed(s, [](size_t i, char e) { cout<<i<<'	'<<e<<'
'; }, 1u); cout << "-----" << endl; vector<int> v{77, 88, 99}; // without index for_each_indexed(v, [](size_t , int e) { cout<<e<<'
'; }); }

このコードは完全ではありませんが、アイデアは得られます。

古いC++ 98では、次のようになります。

#include <iostream> #include <vector> using namespace std; struct with_index { void operator()(ostream& out, vector<int>::size_type i, int e) { out << i << '	' << e << '
'; } }; struct without_index { void operator()(ostream& out, vector<int>::size_type i, int e) { out << e << '
'; } }; template <typename Func> void writeVector(const vector<int>& v, Func f) { for (vector<int>::size_type i=0; i<v.size(); ++i) { f(cout, i, v[i]); } } int main() { vector<int> v; v.Push_back(77); v.Push_back(88); v.Push_back(99); writeVector(v, with_index()); cout << "-----" << endl; writeVector(v, without_index()); return 0; }

繰り返しますが、コードは完璧にはほど遠いですが、それはあなたにアイデアを与えます。

Marc Claesen · Answer

関数では、結果が常に同じであっても、forループのすべてのラウンドでif（インデックス）チェックを実行しています。これは「パフォーマンスを心配する」ことに反対です。

Ifこれが事実である場合、分岐予測子は（一定の）結果を予測するのに問題はありません。そのため、これは最初の数回の反復で予測ミスの軽度のオーバーヘッドを引き起こすだけです。パフォーマンスの点で心配することはありません

この場合、明確にするためにテストをループ内に保持することを推奨します。

syam · Answer

完全に正しいが、まだいくつかのコードを複製しているALiの答えを拡張するには（ループ本体の一部、これは戦略パターンを使用する場合、残念ながらほとんど回避できません）...

この特定のケースでは、コードの重複はそれほど多くありませんが、それをさらに減らす方法があります。これは便利です関数本体が数命令よりも大きい場合。

重要なのは、コンパイラの機能を使用して定数の折りたたみ/デッドコードの除去を実行することです。 indexの実行時の値をコンパイル時の値に手動でマッピングし（この例では2つ）、ケースの数が限られている場合に簡単にマッピングし、型のないテンプレートを使用することにより、コンパイル時に知られている引数：

template<bool index = true> // ^^^^^^ note: the default value is now part of the template version // see below to understand why void writeVector(const vector<int>& vec) { for (size_t i = 0; i < vec.size(); ++i) { if (index) { // compile-time constant: this test will always be eliminated cout << i << "	"; // this will only be kept if "index" is true } cout << vec[i] << "
"; } } void writeVector(const vector<int>& vec, bool index) // ^^^^^ note: no more default value, otherwise // it would clash with the template overload { if (index) // runtime decision writeVector<true>(vec); // ^^^^ map it to a compile-time constant else writeVector<false>(vec); }

このようにして、2番目のコード例（outer if/inner for）と同等のコンパイル済みコードになりますが、コードを自分で複製する必要はありません。これで、writeVectorのテンプレートバージョンを必要なだけ複雑にすることができます。維持するコードは常に1つになります。

テンプレートバージョン（非型テンプレート引数の形式でコンパイル時定数を取る）と非テンプレートバージョン（関数引数としてランタイム変数を取る）がオーバーロードされることに注意してください。これにより、ニーズに応じて最も関連性の高いバージョンを選択することができ、どちらの場合でもかなり似た覚えやすい構文を使用できます。

writeVector<true>(vec); // you already know at compile-time which version you want // no need to go through the non-template runtime dispatching writeVector(vec, index); // you don't know at compile-time what "index" will be // so you have to use the non-template runtime dispatching writeVector(vec); // you can even use your previous syntax using a default argument // it will call the template overload directly

ivaigult · Answer

ほとんどの場合、コードはすでにパフォーマンスと可読性に優れています。優れたコンパイラーは、ループの不変条件を検出し、適切な最適化を行うことができます。コードに非常に近い次の例を検討してください。

#include <cstdio> #include <iterator> void write_vector(int* begin, int* end, bool print_index = false) { unsigned index = 0; for(int* it = begin; it != end; ++it) { if (print_index) { std::printf("%d: %d
", index, *it); } else { std::printf("%d
", *it); } ++index; } } int my_vector[] = { 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, }; int main(int argc, char** argv) { write_vector(std::begin(my_vector), std::end(my_vector)); }

次のコマンドラインを使用してコンパイルします。

g++ --version g++ (GCC) 4.9.1 Copyright (C) 2014 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. g++ -O3 -std=c++11 main.cpp

次に、アセンブリをダンプしましょう：

objdump -d a.out | c++filt > main.s

write_vectorの結果のアセンブリは次のとおりです。

00000000004005c0 <write_vector(int*, int*, bool)>: 4005c0: 48 39 f7 cmp %rsi,%rdi 4005c3: 41 54 Push %r12 4005c5: 49 89 f4 mov %rsi,%r12 4005c8: 55 Push %rbp 4005c9: 53 Push %rbx 4005ca: 48 89 fb mov %rdi,%rbx 4005cd: 74 25 je 4005f4 <write_vector(int*, int*, bool)+0x34> 4005cf: 84 d2 test %dl,%dl 4005d1: 74 2d je 400600 <write_vector(int*, int*, bool)+0x40> 4005d3: 31 ed xor %ebp,%ebp 4005d5: 0f 1f 00 nopl (%rax) 4005d8: 8b 13 mov (%rbx),%edx 4005da: 89 ee mov %ebp,%esi 4005dc: 31 c0 xor %eax,%eax 4005de: bf a4 06 40 00 mov $0x4006a4,%edi 4005e3: 48 83 c3 04 add $0x4,%rbx 4005e7: 83 c5 01 add $0x1,%ebp 4005ea: e8 81 fe ff ff callq 400470 <printf@plt> 4005ef: 49 39 dc cmp %rbx,%r12 4005f2: 75 e4 jne 4005d8 <write_vector(int*, int*, bool)+0x18> 4005f4: 5b pop %rbx 4005f5: 5d pop %rbp 4005f6: 41 5c pop %r12 4005f8: c3 retq 4005f9: 0f 1f 80 00 00 00 00 nopl 0x0(%rax) 400600: 8b 33 mov (%rbx),%esi 400602: 31 c0 xor %eax,%eax 400604: bf a8 06 40 00 mov $0x4006a8,%edi 400609: 48 83 c3 04 add $0x4,%rbx 40060d: e8 5e fe ff ff callq 400470 <printf@plt> 400612: 49 39 dc cmp %rbx,%r12 400615: 75 e9 jne 400600 <write_vector(int*, int*, bool)+0x40> 400617: eb db jmp 4005f4 <write_vector(int*, int*, bool)+0x34> 400619: 0f 1f 80 00 00 00 00 nopl 0x0(%rax)

関数の開始時に、値を確認し、次の2つのループのいずれかにジャンプすることがわかります。

 4005cf: 84 d2 test %dl,%dl 4005d1: 74 2d je 400600 <write_vector(int*, int*, bool)+0x40>

もちろん、これは、コンパイラーが条件が実際に不変であることを検出できる場合にのみ機能します。通常、フラグおよび単純なインライン関数に対して完全に機能します。ただし、条件が「複雑」である場合は、他の回答のアプローチを使用することを検討してください。